libndpi.git - Open Source Deep Packet Inspection Software Toolkit

	Commit message (Collapse)	Author	Age
*	Allow multiple `struct ndpi_detection_module_struct` to share some state (#2271)	Ivan Nardi	2024-02-01
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add the concept of "global context". Right now every instance of `struct ndpi_detection_module_struct` (we will call it "local context" in this description) is completely independent from each other. This provide optimal performances in multithreaded environment, where we pin each local context to a thread, and each thread to a specific CPU core: we don't have any data shared across the cores. Each local context has, internally, also some information correlating different flows; something like: ``` if flow1 (PeerA <-> Peer B) is PROTOCOL_X; then flow2 (PeerC <-> PeerD) will be PROTOCOL_Y ``` To get optimal classification results, both flow1 and flow2 must be processed by the same local context. This is not an issue at all in the far most common scenario where there is only one local context, but it might be impractical in some more complex scenarios. Create the concept of "global context": multiple local contexts can use the same global context and share some data (structures) using it. This way the data correlating multiple flows can be read/write from different local contexts. This is an optional feature, disabled by default. Obviously data structures shared in a global context must be thread safe. This PR updates the code of the LRU implementation to be, optionally, thread safe. Right now, only the LRU caches can be shared; the other main structures (trees and automas) are basically read-only: there is little sense in sharing them. Furthermore, these structures don't have any information correlating multiple flows. Every LRU cache can be shared, independently from the others, via `ndpi_set_config(ndpi_struct, NULL, "lru.$CACHE_NAME.scope", "1")`. It's up to the user to find the right trade-off between performances (i.e. without shared data) and classification results (i.e. with some shared data among the local contexts), depending on the specific traffic patterns and on the algorithms used to balance the flows across the threads/cores/local contexts. Add some basic examples of library initialization in `doc/library_initialization.md`. This code needs libpthread as external dependency. It shouldn't be a big issue; however a configure flag has been added to disable global context support. A new CI job has been added to test it. TODO: we should need to find a proper way to add some tests on multithreaded enviroment... not an easy task... * API changes * If you are not interested in this feature, simply add a NULL parameter to any `ndpi_init_detection_module()` calls.
*	Provide a u64 wrapper for `ndpi_set_config()` (#2292)	Toni	2024-01-30
\| \| \|	Signed-off-by: Toni Uhlig <matzeton@googlemail.com>
*	fuzz: extend fuzzing coverage (#2281)	Ivan Nardi	2024-01-24
\|
*	config: follow-up (#2268)	Ivan Nardi	2024-01-20
\| \| \| \| \| \|	Some changes in the parameters names. Add a fuzzer to fuzz the configuration file format. Add the infrastructure to configuratin callbacks. Add an helper to map LRU cache indexes to names.
*	config: allow configuration of guessing algorithms	Nardi Ivan	2024-01-18
\|
*	config: move debug/log configuration to the new API	Nardi Ivan	2024-01-18
\|
*	config: configure TLS certificate expiration with the new API	Nardi Ivan	2024-01-18
\|
*	config: remove `enum ndpi_prefs`	Nardi Ivan	2024-01-18
\|
*	config: remove `ndpi_set_detection_preferences()`	Nardi Ivan	2024-01-18
\|
*	config: move cfg of aggressiviness and opportunistic TLS to the new API	Nardi Ivan	2024-01-18
\|
*	config: move LRU cache configurations to the new API	Nardi Ivan	2024-01-18
\|
*	Make `ndpi_finalize_initialization()` returns an error code	Nardi Ivan	2024-01-18
\| \| \| \|	We should check if the initialization was fine or not
*	New API for library configuration	Nardi Ivan	2024-01-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the first step into providing (more) configuration options in nDPI. The idea is to have a simple way to configure (most of) nDPI: only one function (`ndpi_set_config()`) to set any configuration parameters (in the present or on in the future) and we try to keep this function prototype as agnostic as possible. You can configure the library: * via API, using `ndpi_set_config()` * via a configuration file, in a text format This way, anytime we need to add a new configuration parameter: * we don't need to add two public functions (a getter and a setter) * we don't break API/ABI compatibility of the library; even changing the parameter type (from integer to a list of integer, for example) doesn't break the compatibility. The complete list of configuration options is provided in `doc/configuration_parameters.md`. As a first example, two configuration knobs are provided: * the ability to enable/disable the extraction of the sha1 fingerprint of the TLS certificates. * the upper limit on the number of packets per flow that will be subject to inspection
*	Added ndpi_get_host_domain() for returning the host domain	Luca	2024-01-16
\| \| \| \|	vs ndpi_get_host_domain_prefix() that instead returnd the host TLD
*	Added new API calls	Luca	2024-01-15
\| \| \| \| \| \| \| \| \| \|	- ndpi_load_domain_suffixes() - ndpi_get_host_domain_suffix() whose goal is to find the domain name of a hostname. Example: www.bbc.co.uk -> co.uk mail.apple.com -> com
*	Add an implementation of the BSD function `strtonum` (#2238)	Ivan Nardi	2024-01-04
\| \| \| \| \|	The main difference with the original function is that we allow to specify the base. Credit for the original idea and the first implementation to @0xA50C1A1
*	Add IEC62056 (DLMS/COSEM) protocol dissector (#2229)	Vladimir Gavrilov	2024-01-02
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add IEC62056 (DLMS/COSEM) protocol dissector * Fix detection on big endian architectures * Update protocols.rst * Add ndpi_crc16_x25 to fuzz/fuzz_alg_crc32_md5.c * Update pcap sample * Remove empty .out file * iec62056: add some documentation --------- Co-authored-by: Nardi Ivan <nardi.ivan@gmail.com>
*	fuzz: improve fuzzing coverage (#2239)	Ivan Nardi	2024-01-02
\|
*	Implemented ndpi_is_outlier() for detecting outliers using z-score	Luca	2023-12-28
\|
*	Implements ndpi_pearson_correlation for measuring how correlated are two series	Luca Deri	2023-12-27
\|
*	New ndpi_sha256() nDPI API call (#2230)	Luca Deri	2023-12-23
\| \| \| \| \|	* Added ndpi_sha256.c to the Windows project * Added ndpi_sha256() nDPI API call
*	fuzz: extend fuzzing coverage (#2208)	Ivan Nardi	2023-12-11
\|
*	STUN: rework extra dissection (#2202)	Ivan Nardi	2023-12-11
\| \| \| \| \| \| \|	Keep looking for RTP packets but remove the monitoring concept. We will re-introduce a more general concept of "flow in monitoring state" later. The function was disabled by default. Some configuration knobs will be provided when/if #2190 is merged.
*	Add some fast CRC16 algorithms implementation (#2195)	Vladimir Gavrilov	2023-12-05
\| \| \| \| \| \| \| \| \|	* Add some fast CRC16 algorithms implementation * Update ndpi_crc.c * Move crc16 stuff to ndpi_analyze.c * IEEE C37.118: use new fast CRC-16/CCITT-FALSE implementation
*	Keep separating public and private API (#2157)	Ivan Nardi	2023-11-29
\| \| \|	See: b08c787fe
*	Fixed implicit u32 cast in `ndpi_data_min()` / `ndpi_data_max()`. (#2139)	Toni	2023-11-09
\| \| \|	Signed-off-by: Toni Uhlig <matzeton@googlemail.com>
*	Rename some functions with more useful/clear names (#2127)	Ivan Nardi	2023-10-29
\|
*	IPv6: add support for custom categories (#2126)	Ivan Nardi	2023-10-29
\|
*	ipv6: add support for ipv6 addresses lists (#2113)	Ivan Nardi	2023-10-26
\|
*	Added NDPI_MALWARE_HOST_CONTACTED flow risk	Luca Deri	2023-10-13
\|
*	QUIC: export QUIC version as metadata	Nardi Ivan	2023-10-11
\|
*	Added printf/fprintf replacement for some internal modules. (#1974)	Toni	2023-09-26
\| \| \| \| \| \|	* logging is instead redirected to `ndpi_debug_printf` Signed-off-by: lns <matzeton@googlemail.com> Signed-off-by: Toni Uhlig <matzeton@googlemail.com>
*	Fix some prototypes (#2085)	Ivan Nardi	2023-09-18
\| \| \| \| \|	``` error: function declaration isn’t a prototype [-Werror=strict-prototypes] ```
*	Add `ndpi_domain_classify_finalize()` function (#2084)	Ivan Nardi	2023-09-12
\| \| \| \| \| \| \| \| \|	The "domain classify" data structure is immutable, since it uses "bitmap64". Allow to finalize it before starting to process packets (i.e. before calling `ndpi_domain_classify_contains()`) to avoid, in the data-path, all the memory allocations due to compression. Calling `ndpi_domain_classify_finalize()` is optional.
*	fuzz: add fuzzers to test bitmap64 and domain_classify data structures (#2082)	Ivan Nardi	2023-09-10
\|
*	Cleanup	Luca	2023-09-07
\|
*	Improved classification further reducing memory used	Luca Deri	2023-09-05
\|
*	Added ndpi_bitmap64 support	Luca Deri	2023-09-05
\|
*	Added ndpi_murmur_hash to the nDPI API	Luca Deri	2023-09-04
\|
*	Reworked domain classification based on binary filters	Luca Deri	2023-09-02
\|
*	Code cleanup	Luca Deri	2023-09-01
\|
*	Added ndpi_binary_bitmap datastruture	Luca Deri	2023-08-31
\| \| \| \| \|	It is similar to ndpi_filter but based on binary search and with the ability to store a category per value (as ndpi_domain_classify)
*	Code cleanup	Luca Deri	2023-08-31
\|
*	Added comment	Luca Deri	2023-08-31
\|
*	Swap from Aho-Corasick to an experimental/home-grown algorithm that uses a ↵	Luca Deri	2023-08-29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	probabilistic approach for handling Internet domain names. For switching back to Aho-Corasick it is necessary to edit ndpi-typedefs.h and uncomment the line // #define USE_LEGACY_AHO_CORASICK [1] With Aho-Corasick $ ./example/ndpiReader -G ./lists/ -i tests/pcap/ookla.pcap \| grep Memory nDPI Memory statistics: nDPI Memory (once): 37.34 KB Flow Memory (per flow): 960 B Actual Memory: 33.09 MB Peak Memory: 33.09 MB [2] With the new algorithm $ ./example/ndpiReader -G ./lists/ -i tests/pcap/ookla.pcap \| grep Memory nDPI Memory statistics: nDPI Memory (once): 37.31 KB Flow Memory (per flow): 960 B Actual Memory: 7.42 MB Peak Memory: 7.42 MB In essence from ~33 MB to ~7 MB This new algorithm will enable larger lists to be loaded (e.g. top 1M domans https://s3-us-west-1.amazonaws.com/umbrella-static/index.html) In ./lists there are file names that are named as <category>_<string>.list With -G ndpiReader can load all of them at startup
*	Added ndpi_domain_classify_XXX(0 API	Luca Deri	2023-08-26
\|
*	added bimap and/or with allocation	Luca Deri	2023-08-24
\|
*	Minor improvements	Luca Deri	2023-08-23
\|
*	Added ndpi_bitmap_is_empty() and ndpi_bitmap_optimize() API calls	Luca	2023-08-23
\|
*	Added ndpi_bitmap_andnot API call	Luca	2023-08-21
\|