libndpi.git - Open Source Deep Packet Inspection Software Toolkit

	Commit message (Collapse)	Author	Age
...
*	fuzz: extend coverage (#2073)	Ivan Nardi	2023-08-20
\|
*	Add an heuristic to detect fully encrypted flows (#2058)	Ivan Nardi	2023-07-26
\| \| \| \| \| \| \| \|	A fully encrypted session is a flow where every bytes of the payload is encrypted in an attempt to “look like nothing”. The heuristic needs only the very first packet of the flow. See: https://www.usenix.org/system/files/sec23fall-prepub-234-wu-mingshi.pdf A basic, but generic, inplementation of the popcpunt alg has been added
*	fuzz: extend fuzzing coverage (#2052)	Ivan Nardi	2023-07-18
\| \| \| \|	Added/merged some traces. Improved Socks identification
*	fuzz: add fuzzer for DGA detection code (#2042)	Ivan Nardi	2023-07-13
\|
*	fuzz: extend fuzzing coverage (#2040)	Ivan Nardi	2023-07-11
\| \| \| \| \| \| \| \| \|	Some notes: * libinjection: according to https://github.com/libinjection/libinjection/issues/44, it seems NULL characters are valid in the input string; * RTP: `rtp_get_stream_type()` is called only for RTP packets; if you want to tell RTP from RTCP you should use `is_rtp_or_rtcp()`; * TLS: unnecessary check; we already make the same check just above, at the beginning of the `while` loop
*	STUN: keep monitoring/processing STUN flows (#2012)	Ivan Nardi	2023-06-21
\| \| \| \|	Look for RTP packets in the STUN sessions. TODO: tell RTP from RTCP
*	Fix some memory errors triggered by allocation failures (#1995)	Ivan Nardi	2023-05-29
\| \| \| \| \| \| \| \| \| \| \| \|	Some low hanging fruits found using nallocfuzz. See: https://github.com/catenacyber/nallocfuzz See: https://github.com/google/oss-fuzz/pull/9902 Most of these errors are quite trivial to fix; the only exception is the stuff in the uthash. If the insertion fails (because of an allocation failure), we need to avoid some memory leaks. But the only way to check if the `HASH_ADD_*` failed, is to perform a new lookup: a bit costly, but we don't use that code in any critical data-path.
*	ndpiReader: improve printing of payload statistics (#1989)	Ivan Nardi	2023-05-29
\| \| \| \| \|	Add a basic unit test Fix an endianess issue
*	Add an heuristic to detect/ignore some anomalous TCP ACK packets (#1948)	Ivan Nardi	2023-04-25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In some networks, there are some anomalous TCP flows where the smallest ACK packets have some kind of zero padding. It looks like the IP and TCP headers in those frames wrongly consider the 0x00 Ethernet padding bytes as part of the TCP payload. While this kind of packets is perfectly valid per-se, in some conditions they might be treated by the TCP reassembler logic as (partial) overlaps, deceiving the classification engine. Add an heuristic to detect these packets and to ignore them, allowing correct detection/classification. This heuristic is configurable. Default value: * in the library, it is disabled * in `ndpiReader` and in the fuzzers, it is enabled (to ease testing) Credit to @vel21ripn for the initial patch. Close #1946
*	fuzz: extend fuzzers coverage (#1952)	Ivan Nardi	2023-04-25
\|
*	Improved debug output. (#1951)	Toni	2023-04-21
\| \| \| \| \| \|	* try to get rid of some `printf(..)`s as they do not belong to a shared library * replaced all `exit(..)`s with `abort()`s to indicate an abnormal process termination Signed-off-by: Toni Uhlig <matzeton@googlemail.com>
*	Simplify `ndpi_internal_guess_undetected_protocol()` (#1941)	Ivan Nardi	2023-04-12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	`ndpi_guess_undetected_protocol()/ndpi_internal_guess_undetected_protocol()` is a strange function: * it is exported by the library and it is actively used by `ntopng` * it is intrinsecally ipv4-only * it returns basically something like "classification_by_ip"/"classification_by_port" (these information have already been calculated in `ndpi_do_guess()`...) * it access the bittorrent LRU caches (similarly to `ndpi_detection_giveup()` but without all the other caches...) So: * make the interface IPv4/6 agnostic * use the classifications already available This work will allow to make the Bittorrent caches IPV6-aware (see 81e1ea5). Handle Dropbox classification "by-port" in the "standard" way.
*	Test multiple `ndpiReader` configurations (#1931)	Ivan Nardi	2023-04-06
\| \| \| \| \| \| \| \| \|	Extend internal unit tests to handle multiple configurations. As some examples, add tests about: * disabling some protocols * disabling Ookla aggressiveness Every configurations data is stored in a dedicated directory under `tests\cfgs`
*	fuzz: add a new fuzzer triggering the payload analyzer function(s) (#1926)	Ivan Nardi	2023-04-04
\|
*	fuzz: add fuzzer to test internal gcrypt code (#1920)	Ivan Nardi	2023-04-04
\|
*	Update libinjection code (#1918)	Ivan Nardi	2023-04-04
\| \| \| \| \| \| \| \| \| \|	Update libinjection code to the current master https://github.com/libinjection/libinjection/commit/7e4b74e824dc3f0623ba4894df2a0d817351ec39 The goal is to finally fix #1820 See: https://github.com/libinjection/libinjection/issues/33 Update the corpus of the libinjection fuzzers Close #1820
*	Ookla: rework detection (#1922)	Ivan Nardi	2023-03-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The logic of the LRU cache has been changed: once we know an ip has connected to an Ookla server, all the following (unknown) flows (for a short time interval) from the same ip to the port 8080 are treated as Ookla ones. Most of the changes in this commit are about introducing the concept of "aggressive detection". In some cases, to properly detect a protocol we might use some statistical/behavior logic that, from one side, let us to identify the protocol more often but, from the other side, might lead to some false positives. To allow the user/application to easily detect when such logic has been triggered, the new confidence value `NDPI_CONFIDENCE_DPI_AGGRESSIVE` has been added. It is always possible to disable/configure this kind of logic via the API. Detection of Ookla flows using plain TLS over port 8080 is the first example of aggressive detection in nDPI. Tested with: * Android 9.0 with app 4.8.3 * Ubuntu 20.04 with Firefox 110 * Win 10 with app 1.15 and 1.16 * Win 10 with Chrome 108, Edge 108 and Firefox 106
*	fuzz: add a new fuzzer to test TLS certificates (#1901)	Ivan Nardi	2023-03-20
\|
*	fuzz: extend fuzz coverage (#1888)	Ivan Nardi	2023-02-16
\|
*	fuzz: some improvements and add two new fuzzers (#1881)	Ivan Nardi	2023-02-09
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove `FUZZING_BUILD_MODE_UNSAFE_FOR_PRODUCTION` define from `fuzz/Makefile.am`; it is already included by the main configure script (when fuzzing). Add a knob to force disabling of AESNI optimizations: this way we can fuzz also no-aesni crypto code. Move CRC32 algorithm into the library. Add some fake traces to extend fuzzing coverage. Note that these traces are hand-made (via scapy/curl) and must not be used as "proof" that the dissectors are really able to identify this kind of traffic. Some small updates to some dissectors: CSGO: remove a wrong rule (never triggered, BTW). Any UDP packet starting with "VS01" will be classified as STEAM (see steam.c around line 111). Googling it, it seems right so. XBOX: XBOX only analyses UDP flows while HTTP only TCP ones; therefore that condition is false. RTP, STUN: removed useless "break"s Zattoo: `flow->zattoo_stage` is never set to any values greater or equal to 5, so these checks are never true. PPStream: `flow->l4.udp.ppstream_stage` is never read. Delete it. TeamSpeak: we check for `flow->packet_counter == 3` just above, so the following check `flow->packet_counter >= 3` is always false.
*	fuzz: add a new fuzzer to test serialization/deserialization code (#1876)	Ivan Nardi	2023-01-27
\| \| \| \| \| \| \| \| \| \|	Autodetecting the needed buffer size is quite complex (especially with float/double values) so it is mandatory to properly check for `ndpi_snprintf` truncation. These issues have been undetected so far probably because the default buffer is big enough for all common cases. Add an example of usage of `ndpi_deserialize_clone_all()` (taken from `ntopng`)
*	Add some fuzzers to test other data structures. (#1870)	Ivan Nardi	2023-01-25
\| \| \| \| \| \| \|	Start using a dictionary for fuzzing (see: https://llvm.org/docs/LibFuzzer.html#dictionaries). Remove some dead code. Fuzzing with debug enabled is not usually a great idea (from performance POV). Keep the code since it might be useful while debugging.
*	fuzz: fix memory allocation failure logic (#1867)	Ivan Nardi	2023-01-20
\| \| \| \| \|	We do want to have some allocation errors. Fix some related bugs Fix: 29be01ef
*	Fuzz: fix compilation with nBPF (#1860)	Ivan Nardi	2023-01-18
\| \| \|	Close #1859
*	LRU caches: add a generic (optional and configurable) expiration logic (#1855)	Ivan Nardi	2023-01-18
\| \| \|	Two caches already implemented a similar mechanism: make it generic.
*	Add some fuzzers to test algorithms and data structures (#1852)	Ivan Nardi	2023-01-17
\| \| \|	Fix some issues found with these new fuzzers
*	fuzz: add fuzzer testing nDPI (initial) configurations (#1830)	Ivan Nardi	2022-12-23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The goal of this fuzzer is to test init and deinit of the library, with different configurations. In details: * random memory allocation failures, even during init phase * random `ndpi_init_prefs` parameter of `ndpi_init_detection_module()` * random LRU caches sizes * random bitmask of enabled protocols * random parameters of `ndpi_set_detection_preferences()` * random initialization of opportunistic TLS * random load/don't load of configuration files This new fuzzer is a C++ file, because it uses `FuzzedDataProvider` class (see https://github.com/google/fuzzing/blob/master/docs/split-inputs.md). Note that the (existing) fuzzers need to be linked with C++ compiler anyway, so this new fuzzer doesn't add any new requirements.
*	fuzz: some enhancements (#1827)	Ivan Nardi	2022-12-10
\| \| \| \| \| \| \| \| \| \|	Load some custom configuration (like in the unit tests) and factorize some (fuzzing) common code. There is no way to pass file paths to the fuzzers as parameters. The safe solution seems to be to load them from the process working dir. Anyway, missing file is not a blocking error. Remove some dead code (found looking at the coverage report)
*	fuzz: fix signed-integer-overflow (#1822)	Ivan Nardi	2022-12-10
\| \| \| \| \| \| \| \| \| \|	``` fuzz_ndpi_reader.c:33:29: runtime error: signed integer overflow: 214013 * 24360337 cannot be represented in type 'int' #0 0x4c1cf7 in fastrand ndpi/fuzz/fuzz_ndpi_reader.c:33:29 #1 0x4c1cf7 in malloc_wrapper ndpi/fuzz/fuzz_ndpi_reader.c:38:11 #2 0x523057 in ndpi_malloc ndpi/src/lib/ndpi_main.c:220:25 ``` Found by oss-fuzz See: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=54112
*	fuzz: add a new fuzzer testing memory allocation failures (#1818)	Ivan Nardi	2022-12-06
\| \| \| \| \| \| \| \| \|	Try to fuzz error paths triggered by allocation errors. Fix some errors already found by this new fuzzer. Basic idea taken from: https://github.com/harfbuzz/harfbuzz/pull/2566/files `FUZZING_BUILD_MODE_UNSAFE_FOR_PRODUCTION` is a standard define used to (not)compile specific code in fuzzing builds. See: https://llvm.org/docs/LibFuzzer.html
*	Add support for LTO and Gold linker (#1812)	Ivan Nardi	2022-12-05
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This commit add (optional) support for Link-Time-Optimization and Gold linker. This is the first, mandatory step needed to make nDPI compliant with "introspector" sanitizer requirements in OSS-Fuzz: see https://github.com/google/oss-fuzz/issues/8939 Gold linker is not supported by Windows and by macOS, so this feature is disabled by default. It has been enable in CI in two linux targets ("latest" gcc and clang). Fix some warnings triggered by LTO. The changes in `src/lib/ndpi_serializer.c` seams reasonable. However, the change in `tests/unit/unit.c` is due to the following warning, which seems to be a false positive. ``` unit.c: In function ‘serializerUnitTest’: ndpi_serializer.c:2258:13: error: ‘MEM[(struct ndpi_private_serializer )&deserializer].buffer.size’ may be used uninitialized in this function [-Werror=maybe-uninitialized] unit.c:67:31: note: ‘MEM[(struct ndpi_private_serializer )&deserializer].buffer.size’ was declared here 67 \| ndpi_serializer serializer, deserializer; \| ^ ndpi_serializer.c:2605:10: error: ‘MEM[(struct ndpi_private_serializer )&deserializer].status.buffer.size_used’ may be used uninitialized in this function [-Werror=maybe-uninitialized] unit.c:67:31: note: ‘MEM[(struct ndpi_private_serializer )&deserializer].status.buffer.size_used’ was declared here 67 \| ndpi_serializer serializer, deserializer; ``` Since this warning is triggered only with an old version of gcc and `tests/unit/unit.c` is used only during the tests, the easiest fix has been applied. Some (unknown to me) combinations of OS and compiler trigger the following warnings at linker time (with sanitizer and gold linker) ``` /usr/bin/ld.gold: warning: Cannot export local symbol '__asan_report_load1_asm' /usr/bin/ld.gold: warning: Cannot export local symbol '__asan_report_load2_asm' /usr/bin/ld.gold: warning: Cannot export local symbol '__asan_report_load4_asm' /usr/bin/ld.gold: warning: Cannot export local symbol '__asan_report_load8_asm' /usr/bin/ld.gold: warning: Cannot export local symbol '__asan_report_load16_asm' /usr/bin/ld.gold: warning: Cannot export local symbol '__asan_report_store1_asm' /usr/bin/ld.gold: warning: Cannot export local symbol '__asan_report_store2_asm' /usr/bin/ld.gold: warning: Cannot export local symbol '__asan_report_store4_asm' [..] ``` I have not found any references to this kind of message, with the only exception of https://sourceware.org/bugzilla/show_bug.cgi?id=25975 which seems to suggest that these messages can be safely ignored. In any case, the compilation results are sound. Fix `clean` target in the Makefile in the `example` directory. In OSS-Fuzz enviroments, `fuzz_ndpi_reader` reports a strange link error (as always, when the gold linker is involved...). It's come out that the culprit was the `tempnam` function: the code has been changed to use `tmpfile` instead. No sure why... :( Fuzzing target `fuzz_ndpi_reader.c` doesn't use `libndpiReader.a` anymore: this way we can use `--with-only-libndpi` flag on Oss-Fuzz builds as workaround for the "missing dependencies errors" described in https://github.com/google/oss-fuzz/issues/8939
*	Remove some useless fuzzing target (#1813)	Ivan Nardi	2022-12-01
\| \| \| \| \|	We don't need specific targets to reproduce fuzzing issues. After all, calling `./fuzz/fuzz_process_packet_with_main $ARTIFACT_FILE` is equivalento to `./fuzz/fuzz_process_packet $ARTIFACT_FILE`
*	Add a new fuzzer for QUIC (#1800)	Ivan Nardi	2022-12-01
\| \| \| \| \| \| \| \| \| \| \| \|	QUIC packets are encrypted/obfuscated; that means that we need to decrypt them before parsing the real (TLS) message. Fuzzing is not effective here, since a random buffer is hardly a valid encrypted QUIC packet. Add a new fuzzer, testing decrypted QUIC packets. Add a basic corpus. Fix a few bugs already found by this fuzzer.
*	Add basic profiling of memory allocations on data-path (#1789)	Ivan Nardi	2022-10-28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The goal is to have an idea of the memory allocation sizes performed in the library data-path, i.e. excluding init/deinit phases and all the allocations made by the application itself. In other words, how much memory is needed per-flow, by nDPI, other than `struct ndpi_flow_struct`? It works only on single-thread configurations. It is not enabled by default (in the unit tests) since different canfiguration options (example: `--enable-pcre`) lead to diffferent results. See: #1781
*	Fix compilation and sync utests results	Nardi Ivan	2022-10-04
\|
*	Improved nDPI JSON serialization. (#1689)	Toni	2022-08-02
\| \| \| \| \| \| \| \|	* fixed autoconf CFLAGS/LDFLAGS MSAN issue which could lead to build errors * introduced portable version of gmtime_r aka ndpi_gmtime_r * do as most as possible of the serialization work in ndpi_utils.c * use flow2json in ndpiReader Signed-off-by: lns <matzeton@googlemail.com>
*	Add support for flow client/server information (#1671)	Ivan Nardi	2022-07-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In a lot of places in ndPI we use packet source/dest info (address/port/direction) when we are interested in flow client/server info, instead. Add basic logic to autodetect this kind of information. nDPI doesn't perform any "flow management" itself but this task is delegated to the external application. It is then likely that the application might provide more reliable hints about flow client/server direction and about the TCP handshake presence: in that case, these information might be (optionally) passed to the library, disabling the internal "autodetect" logic. These new fields have been used in some LRU caches and in the "guessing" algorithm. It is quite likely that some other code needs to be updated.
*	Do not interfere with CFLAGS/LDFLAGS env anymore. (#1659)	Toni	2022-07-13
\| \| \| \| \| \| \|	* CI fixes * some build systems do not like that (e.g. OpenWrt) * fixed some rrdtool related build warnings/errors Signed-off-by: Toni Uhlig <matzeton@googlemail.com>
*	Fix some debug messages (#1583)	Ivan Nardi	2022-06-05
\| \| \|	Increase max number of flows handled during fuzzing
*	Replaced ndpiReader's libjson-c support with libnDPI's internal ↵	Toni	2022-05-07
\| \| \| \| \| \| \| \| \| \| \|	serialization interface. (#1535) * Fixes #1528 * Serialization Interface should also fuzzed * libjson-c may only be used in the unit test to verify the internal serialization interface * Serialization Interface supports tlv(broken), csv and json * Unit test does work again and requires libjson-c Signed-off-by: lns <matzeton@googlemail.com>
*	fuzz: purge old sessions (#1451)	Ivan Nardi	2022-02-21
\| \| \| \| \| \| \| \|	At every fuzz iteration (i.e for every trace file): * keep the same ndpi context (`ndpi_init_detection_module` is very slow); * reset the flow table, otherwise it grows indefinitely. This change should fix the "out-of-memory" errors reported by oss-fuzz.
*	fuzz: make fuzz_ndpi_reader faster (#1446)	Ivan Nardi	2022-02-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Initialize ndpi_workflow_init context only once. On a quite old notebook, before: ``` $ ./fuzz/fuzz_ndpi_reader -max_total_time="${MAX_TOTAL_TIME:-360}" -print_pcs=1 -workers="${FUZZY_WORKERS:-0}" -jobs="${FUZZY_JOBS:-0}" ./tests/pcap/ [...] Done 3256 runs in 361 second(s) ``` after: ``` Done 5032771 runs in 361 second(s) <----------- ~1400X ``` oss-fuzz will be happy!
*	Remove `struct ndpi_id_struct` (#1427)	Ivan Nardi	2022-01-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove the last uses of `struct ndpi_id_struct`. That code is not really used and it has not been updated for a very long time: see #1279 for details. Correlation among flows is achieved via LRU caches. This change allows to further reduce memory consumption (see also 91bb77a8). At nDPI 4.0 (more precisly, at a6b10cf, because memory stats were wrong until that commit): ``` nDPI Memory statistics: nDPI Memory (once): 221.15 KB Flow Memory (per flow): 2.94 KB ``` Now: ``` nDPI Memory statistics: nDPI Memory (once): 235.27 KB Flow Memory (per flow): 688 B <-------- ``` i.e. memory usage per flow has been reduced by 77%. Close #1279
*	Fix two use-of-uninitialized-value errors (#1398)	Ivan Nardi	2022-01-12
\| \| \| \| \| \| \|	Found by oss-fuzz: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=40269 https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=41432 Fix fuzz compilation (follow-up of f5545a80)
*	Fix writes to `flow->protos` union fields (#1354)	Ivan Nardi	2021-11-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can write to `flow->protos` only after a proper classification. This issue has been found in Kerberos, DHCP, HTTP, STUN, IMO, FTP, SMTP, IMAP and POP code. There are two kinds of fixes: * write to `flow->protos` only if a final protocol has been detected * move protocol state out of `flow->protos` The hard part is to find, for each protocol, the right tradeoff between memory usage and code complexity. Handle Kerberos like DNS: if we find a request, we set the protocol and an extra callback to further parsing the reply. For all the other protocols, move the state out of `flow->protos`. This is an issue only for the FTP/MAIL stuff. Add DHCP Class Identification value to the output of ndpiReader and to the Jason serialization. Extend code coverage of fuzz tests. Close #1343 Close #1342
*	Fix broken fuzz_process_packet fuzzer by adding a call to ↵	Toni	2021-10-18
\| \| \| \| \| \| \| \| \| \| \| \|	ndpi_finalize_initialization(). (#1334) * fixed several memory errors (heap-overflow, unitialized memory, etc) * ability to build fuzz_process_packet with a main() allowing to replay crash data generated with fuzz_process_packet by LLVMs libfuzzer * temporarily disable fuzzing if `tests/do.sh` executed with env FUZZY_TESTING_ENABLED=1 Signed-off-by: Toni Uhlig <matzeton@googlemail.com>
*	Compile everything with "-W -Wall -Wno-unused-parameter" flags (#1276)	Ivan Nardi	2021-08-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix all the warnings. Getting rid of "-Wno-unused-parameter" is quite complex because some parameters usage depends on compilation variable (i.e. `--enable-debug-messages`). The "-Werror" flag has been added only in Travis builds to avoid breaking the builds to users using uncommon/untested OS/compiler/enviroment. Tested on: * x86_64; Ubuntu 20.04; gcc 7,8,9,10,11; clang 7,8,9,10,11,12 * x86_64; CentOS 7.7; gcc 4.8.5 (with "--disable-gcrypt" flag) * Raspberry 4; Debian 10.10; gcc 8.3.0
*	Fixed uninitialized memory use. (#1198)	Vitaly Lavrov	2021-06-07
\| \| \|	The pointer "header" must be initialized before first call pcap_next_ex().
*	Check datalink during fuzzing to prevent console / logfile spam. See #1175 ↵	Toni	2021-05-09
\| \| \| \| \|	for more information. (#1177) Signed-off-by: Toni Uhlig <matzeton@googlemail.com>
*	Compilation fix	Luca Deri	2021-04-26
\|