libndpi.git - Open Source Deep Packet Inspection Software Toolkit

	Commit message (Collapse)	Author	Age
*	Improved nDPI JSON serialization. (#1689)	Toni	2022-08-02
\| \| \| \| \| \| \| \|	* fixed autoconf CFLAGS/LDFLAGS MSAN issue which could lead to build errors * introduced portable version of gmtime_r aka ndpi_gmtime_r * do as most as possible of the serialization work in ndpi_utils.c * use flow2json in ndpiReader Signed-off-by: lns <matzeton@googlemail.com>
*	SMTPS, POPS, IMAPS: fix classification and extra dissection (#1685)	Ivan Nardi	2022-07-30
\| \| \| \| \| \|	The big change in TLS code is to allow "master" protocols other than TLS/DTLS, like SMTPS, POPS and IMAPS. This change will allow, in a future, a proper and complete TLS dissection for all these protocols with "STARTTLS"-like messages.
*	Add Softether dissector. (#1679)	Toni	2022-07-29
\| \| \|	Signed-off-by: lns <matzeton@googlemail.com>
*	Patricia tree, Ahocarasick automa, LRU cache: add statistics (#1683)	Ivan Nardi	2022-07-29
\| \| \| \| \| \| \| \| \| \|	Add (basic) internal stats to the main data structures used by the library; they might be usefull to check how effective these structures are. Add an option to `ndpiReader` to dump them; enabled by default in the unit tests. This new option enables/disables dumping of "num dissectors calls" values, too (see b4cb14ec).
*	Revert "Patricia tree, Ahocarasick automa, LRU cache: add statistics ↵	Ivan Nardi	2022-07-29
\| \| \| \| \|	(#1677)" (#1682) This reverts commit bb83899985c25097341b947c2c535f56254a075c.
*	Patricia tree, Ahocarasick automa, LRU cache: add statistics (#1677)	Ivan Nardi	2022-07-29
\| \| \| \| \| \| \| \|	Add (basic) internal stats to the main data structures used by the library; they might be usefull to check how effective these structures are. Add an option to `ndpiReader` to dump them; disabled by default to avoid too much fuss with the unit tests.
*	Keep track of how many dissectors calls we made for each flow (#1657)	Ivan Nardi	2022-07-11
\|
*	bins: add support for 64bit bins (#1626)	Ivan Nardi	2022-07-03
\|
*	Fix byte-order issue during ndpiReader tcp/udp src/dst port serialization. ↵	Toni	2022-07-03
\| \| \| \| \| \| \| \|	Fixes #1608. (#1614) * fixed possible memory leak caused by an invalid call to `node_proto_guess_walker()` during serialization * execute serialization code while running regression tests Signed-off-by: Toni Uhlig <matzeton@googlemail.com>
*	Replaced nDPI's internal hashmap with uthash. (#1602)	Toni	2022-06-17
\| \| \|	Signed-off-by: lns <matzeton@googlemail.com>
*	Add some statistics to ndpiReader (#1587)	claudio_burrafato	2022-06-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add some statistics to ndpiReader The purpose of this version of ndpiReader is too adding some other statistics printed by ndpiReader. In this simple version the domain names(in the flows) that are collected are: flow-> ssh_tls.server_info flow-> host_server_name and are placed in a UT_hash_table, ordering them by number of occurrences. * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update example/ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com> * Update ndpiReader.c * Update ndpiReader.c * Update ndpiReader.c * Update ndpiReader.c Co-authored-by: Toni <matzeton@googlemail.com>
*	Improved DNS traffic analysis	Luca Deri	2022-06-13
\| \| \| \|	Added ability to identify application and network protocols
*	Added RiskInfo string	Luca Deri	2022-05-30
\|
*	Compilation fixes for old gcc compiler	Luca Deri	2022-05-28
\|
*	Replaced ndpiReader's libjson-c support with libnDPI's internal ↵	Toni	2022-05-07
\| \| \| \| \| \| \| \| \| \| \|	serialization interface. (#1535) * Fixes #1528 * Serialization Interface should also fuzzed * libjson-c may only be used in the unit test to verify the internal serialization interface * Serialization Interface supports tlv(broken), csv and json * Unit test does work again and requires libjson-c Signed-off-by: lns <matzeton@googlemail.com>
*	Added ability to store custom category file in patricia tree	Luca Deri	2022-04-26
\|
*	Reduce ndpiReader's `-h' spam. `-H' does this job now. (#1523)	Toni	2022-04-21
\| \| \|	Signed-off-by: lns <matzeton@googlemail.com>
*	DGA improvements	Luca Deri	2022-04-16
\|
*	Fixed msys2 build warnings and re-activated CI Mingw64 build.fix/windows-msys2	Toni Uhlig	2022-04-14
\| \| \| \| \| \| \|	* Removed Visual Studio leftovers. Maintaining an autotools project with VS integration requires some additional overhead. Signed-off-by: Toni Uhlig <matzeton@googlemail.com> Signed-off-by: lns <matzeton@googlemail.com>
*	Added ndpi_find_outliers() API call using Z-Score	Luca Deri	2022-04-04
\|
*	ndpiReader: fix compilation (#1510)	Ivan Nardi	2022-04-01
\| \| \| \|	Not sure why Windows started complaining... anyway, the fixes has been taken from https://github.com/ntop/nDPI/pull/1491: credits to @lnslbrty
*	Fixed incompatibilities due to https://github.com/ntop/nDPI/pull/1509	Luca Deri	2022-04-01
\|
*	DGA improvements	Luca Deri	2022-04-01
\|
*	Waring fixes	Luca Deri	2022-04-01
\|
*	ndpireader: add json output back. (#1509)	Vitaliy Ivanov	2022-04-01
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- partial revert of: commit 51cfdfb0d80a7bbcc11bc3b95d1696d8dae900c2 Author: Luca Deri <deri@ntop.org> Date: Sun Nov 17 17:51:45 2019 +0100 Removed unused JSON-C code - Json option is changed from 'j' to 'k' as it's used in the new codebase. - use HAVE_LIBJSON_C instead of HAVE_JSON_C. - tabs vs spaces clean ups. Signed-off-by: Vitaliy Ivanov <vitaliyi@interfacemasters.com> Conflicts: example/ndpiReader.c
*	Improved DGA detection	Luca Deri	2022-03-31
\|
*	Improved DGA detection skipping names containign at least 3 consecutive ↵	Luca Deri	2022-03-26
\| \| \| \|	digits in the first word
*	Bug fixing. (#1459)	Vitaly Lavrov	2022-02-28
\| \| \| \| \|	The '--enable-debug-messages' option works again. Fixed warning in ahocorasick.c Fixed integer overflow in ndpiReader.c for 32bit systems.
*	fuzz: purge old sessions (#1451)	Ivan Nardi	2022-02-21
\| \| \| \| \| \| \| \|	At every fuzz iteration (i.e for every trace file): * keep the same ndpi context (`ndpi_init_detection_module` is very slow); * reset the flow table, otherwise it grows indefinitely. This change should fix the "out-of-memory" errors reported by oss-fuzz.
*	Added newflow risk NDPI_HTTP_CRAWLER_BOT	Luca Deri	2022-02-17
\|
*	Remove `struct ndpi_id_struct` (#1427)	Ivan Nardi	2022-01-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove the last uses of `struct ndpi_id_struct`. That code is not really used and it has not been updated for a very long time: see #1279 for details. Correlation among flows is achieved via LRU caches. This change allows to further reduce memory consumption (see also 91bb77a8). At nDPI 4.0 (more precisly, at a6b10cf, because memory stats were wrong until that commit): ``` nDPI Memory statistics: nDPI Memory (once): 221.15 KB Flow Memory (per flow): 2.94 KB ``` Now: ``` nDPI Memory statistics: nDPI Memory (once): 235.27 KB Flow Memory (per flow): 688 B <-------- ``` i.e. memory usage per flow has been reduced by 77%. Close #1279
*	Minor cosmetic changes	Luca Deri	2022-01-16
\|
*	Added the ability to specify trusted issueDN often used in companies to ↵	Luca Deri	2022-01-13
\| \| \| \| \| \| \| \| \| \| \|	self-signed certificates This allows to avoid triggering alerts for trusted albeit private certificate issuers. Extended the example/protos.txt with the new syntax for specifying trusted issueDN. Example: trusted_issuer_dn:"CN=813845657003339838, O=Code42, OU=TEST, ST=MN, C=US"
*	Removed outdated comment	Luca Deri	2022-01-11
\|
*	Removed legacy code	Luca Deri	2022-01-11
\|
*	Add a "confidence" field about the reliability of the classification. (#1395)	Ivan Nardi	2022-01-11
\| \| \| \| \| \| \| \| \| \| \| \| \|	As a general rule, the higher the confidence value, the higher the "reliability/precision" of the classification. In other words, this new field provides an hint about "how" the flow classification has been obtained. For example, the application may want to ignore classification "by-port" (they are not real DPI classifications, after all) or give a second glance at flows classified via LRU caches (because of false positives). Setting only one value for the confidence field is a bit tricky: more work is probably needed in the next future to tweak/fix/improve the logic.
*	Update copyright	Alfredo Cardigliano	2022-01-03
\|
*	A final(?) effort to reduce memory usage per flow (#1389)	Ivan Nardi	2021-12-22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove some unused fields and re-organize other ones. In particular: * Update the parameters of `ndpi_ssl_version2str()` function * Zattoo, Thunder: these timestamps aren't really used. * Ftp/mail: these protocols are dissected only over TCP. * Attention must be paid to TLS.Bittorrent flows to avoid invalid read/write to `flow->protos.bittorrent.hash` field. This is the last(?) commit of a long series (see 22241a1d, 227e586e, 730c2360, a8ffcd8b) aiming to reduce library memory consumption. Before, at nDPI 4.0 (more precisly, at a6b10cf7, because memory stats were wrong until that commit): ``` nDPI Memory statistics: nDPI Memory (once): 221.15 KB Flow Memory (per flow): 2.94 KB ``` Now: ``` nDPI Memory statistics: nDPI Memory (once): 231.71 KB Flow Memory (per flow): 1008 B <--------- ``` i.e. memory usage per flow has been reduced by 66%, dropping below the psychological threshold of 1 KB. To further reduce this value, we probably need to look into #1279: let's fight this battle another day.
*	Fix configure script (after fb85dac9) (#1381)	Ivan Nardi	2021-12-04
\| \| \|	Fix/disable some LGTM warnings
*	Added example for finding similarities in RRDs using nDPI statistical APIs	Luca Deri	2021-12-04
\|
*	ndpiReader: slight simplificaton of the output (#1378)	Ivan Nardi	2021-11-27
\|
*	Rework how hostname/SNI info is saved (#1330)	Ivan Nardi	2021-11-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Looking at `struct ndpi_flow_struct` the two bigger fields are `host_server_name[240]` (mainly for HTTP hostnames and DNS domains) and `protos.tls_quic.client_requested_server_name[256]` (for TLS/QUIC SNIs). This commit aims to reduce `struct ndpi_flow_struct` size, according to two simple observations: 1) maximum one of these two fields is used for each flow. So it seems safe to merge them; 2) even if hostnames/SNIs might be very long, in practice they are rarely longer than a fews tens of bytes. So, using a (single) large buffer is a waste of memory for all kinds of flows. If we need to truncate the name, we keep the last characters, easing domain matching. Analyzing some real traffic, it seems safe to assume that the vast majority of hostnames/SNIs is shorter than 80 bytes. Hostnames/SNIs are always converted to lowercase. Attention was given so as to be sure that unit-tests outputs are not affected by this change. Because of a bug, TLS/QUIC SNI were always truncated to 64 bytes (the first 64 ones): as a consequence, there were some "Suspicious DGA domain name" and "TLS Certificate Mismatch" false positives.
*	Fix writes to `flow->protos` union fields (#1354)	Ivan Nardi	2021-11-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can write to `flow->protos` only after a proper classification. This issue has been found in Kerberos, DHCP, HTTP, STUN, IMO, FTP, SMTP, IMAP and POP code. There are two kinds of fixes: * write to `flow->protos` only if a final protocol has been detected * move protocol state out of `flow->protos` The hard part is to find, for each protocol, the right tradeoff between memory usage and code complexity. Handle Kerberos like DNS: if we find a request, we set the protocol and an extra callback to further parsing the reply. For all the other protocols, move the state out of `flow->protos`. This is an issue only for the FTP/MAIL stuff. Add DHCP Class Identification value to the output of ndpiReader and to the Jason serialization. Extend code coverage of fuzz tests. Close #1343 Close #1342
*	Reduce memory used by `ndpiReader` (#1371)	Ivan Nardi	2021-11-11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	`ndpiReader` is only an example, aiming to show nDPI capabilities and integration, without any claim about performances. Nonetheless its memory usage per flow is huge, limiting the kinds of traces that we can test on a "normal" hardware (example: scan attacks). The key reason of that behaviour is that we preallocate all the memory needed for all the available features. Try to reduce memory usage simply allocating some structures only when they are really needed. Most significant example: JOY algorithms. This way we should use a lot less memory in the two most common user-cases: * `ndpiReader` invoked without any particular flag (i.e `ndpiReader -i $FILENAME_OR_IFACE`) * internal unit tests Before (on x86_64): ``` struct ndpi_flow_info { [...] /* size: 7320, cachelines: 115, members: 72 / ``` After: ``` struct ndpi_flow_info { [...] / size: 2128, cachelines: 34, members: 75 */ ```
*	Add detection of OCSP (#1370)	Ivan Nardi	2021-11-11
\| \| \| \| \| \| \| \| \| \|	This protocol is detected via HTTP Content-Type header. Until 89d548f9, nDPI had a dedicated automa (`content_automa`) to classify a HTTP flow according to this header. Since then, this automa has been useless because it is always empty. Re-enable it to match only a string seems overkilling. Remove all `content_automa` leftovers.
*	Implemented ndpi_ses_fitting() and ndpi_des_fitting()	Luca Deri	2021-10-12
\| \| \| \|	for comuting the best alpha/beta values for exponential smoothing
*	Fix compilation with clang-13 or if some debug macros are enabled (#1326)	Ivan Nardi	2021-10-06
\|
*	Improved DGA detection for skipping potential DGAs of known/popular domain names	Luca Deri	2021-10-05
\|
*	Added -a <num> to ndpiReader for generating OPNsense configuration	Luca Deri	2021-10-04
\| \| \| \|	See https://github.com/ntop/opnsense
*	Removed trace	Luca Deri	2021-10-03
\|