suricata

Commit Graph

Author	SHA1	Message	Date
Victor Julien	70c16f50e7	flow-manager: optimize hash walking Until now the flow manager would walk the entire flow hash table on an interval. It would thus touch all flows, leading to a lot of memory and cache pressure. In scenario's where the number of tracked flows run into the hundreds on thousands, and the memory used can run into many hundreds of megabytes or even gigabytes, this would lead to serious performance degradation. This patch introduces a new approach. A timestamp per flow bucket (hash row) is maintained by the flow manager. It holds the timestamp of the earliest possible timeout of a flow in the list. The hash walk skips rows with timestamps beyond the current time. As the timestamp depends on the flows in the hash row's list, and on the 'state' of each flow in the list, any addition of a flow or changing of a flow's state invalidates the timestamp. The flow manager then has to walk the list again to set a new timestamp. A utility function FlowUpdateState is introduced to change Flow states, taking care of the bucket timestamp invalidation while at it. Empty flow buckets use a special value so that we don't have to take the flow bucket lock to find out the bucket is empty. This patch also adds more performance counters: flow_mgr.flows_checked \| Total \| 929 flow_mgr.flows_notimeout \| Total \| 391 flow_mgr.flows_timeout \| Total \| 538 flow_mgr.flows_removed \| Total \| 277 flow_mgr.flows_timeout_inuse \| Total \| 261 flow_mgr.rows_checked \| Total \| 1000000 flow_mgr.rows_skipped \| Total \| 998835 flow_mgr.rows_empty \| Total \| 290 flow_mgr.rows_maxlen \| Total \| 2 flow_mgr.flows_checked: number of flows checked for timeout in the last pass flow_mgr.flows_notimeout: number of flows out of flow_mgr.flows_checked that didn't time out flow_mgr.flows_timeout: number of out of flow_mgr.flows_checked that did reach the time out flow_mgr.flows_removed: number of flows out of flow_mgr.flows_timeout that were really removed flow_mgr.flows_timeout_inuse: number of flows out of flow_mgr.flows_timeout that were still in use or needed work flow_mgr.rows_checked: hash table rows checked flow_mgr.rows_skipped: hash table rows skipped because non of the flows would time out anyway The counters below are only relating to rows that were not skipped. flow_mgr.rows_empty: empty hash rows flow_mgr.rows_maxlen: max number of flows per hash row. Best to keep low, so increase hash-size if needed. flow_mgr.rows_busy: row skipped because it was locked by another thread	9 years ago
Victor Julien	ae7aae81dc	flow: get flow reference during lookup Update Flow lookup functions to get a flow reference during lookup. This reference is set under the FlowBucket lock. This paves the way to not getting a flow lock during lookups.	10 years ago
Victor Julien	ba64069b35	flow: remove unused debug code	10 years ago
Victor Julien	7426a9c645	flow: make TCP reuse handling in flow engine optional In case of autofp (or more general, when flow and stream engine run in different threads) the flow engine should not trigger a flow reuse as this can lead to race conditions between the flow and the stream engine. In such cases, the flow engine can be far ahead of the stream engine as packets are in a queue between the threads. Observed: Flow engine tags packet 10 as start of new flow. Flow is tagged as 'reused'. Stream engine evaluates packet 5 which belongs to the old flow. It rejects the flow as it's tagged 'reused'. Attaches packet 5 to the new flow which is wrong. Solution: This patch connects the flow engines handling of reuse cases to the runmode. It hooks into the RunmodeSetFlowStreamAsync() call to notify the flow engine that it shouldn't handle the reuse.	11 years ago
Victor Julien	de034f1867	flow: prepare flow forced reuse logging Most flows are marked for clean up by the flow manager, which then passes them to the recycler. The recycler logs and cleans up. However, under resource stress conditions, the packet threads can recycle existing flow directly. So here the recycler has no role to play, as the flow is immediately used. For this reason, the packet threads need to be able to invoke the flow logger directly. The flow logging thread ctx will stored in the DecodeThreadVars stucture. Therefore, this patch makes the DecodeThreadVars an argument to FlowHandlePacket.	12 years ago
Ken Steele	d12834769a	Add const for Packet * in flow functions. By moving FlowReference() out of FlowGetFlowFromHash() and into the one function that calls it, all the flow functions take const Packet * instead of Packet *.	12 years ago
Ken Steele	62540eff3e	Align some structures to cacheline Align strucutres with pthread mutex locks to start on cachelines to keep the lock within one cacheline.	12 years ago
Victor Julien	0150e66ede	flow engine: improve scalability Major redesign of the flow engine. Remove the flow queues that turned out to be major choke points when using many threads. Flow manager now walks the hash table directly. Simplify the way we get a new flow in case of emergency.	14 years ago
William Metcalf	2eef905c07	GPL and Copyright header updates.	16 years ago
Victor Julien	548a3b2c93	Improve flow hash debugging functions. Make sure ICMP errors don't create flows. Handle ICMP DEST UNREACH errors in the flow they are sending the error about.	16 years ago
Victor Julien	2dc5405d3a	Add debug code for tracking flow hash distribution. Only add ICMP DEST_UNREACH packets to the flow engine.	16 years ago
Victor Julien	0ebf7cbc5e	Convert flow bucket lock from mutex to spinlock. Locks should be very short, so spinlocks should be faster.	16 years ago
William Metcalf	ce01927515	Import of GPLv2 Header 050410	16 years ago
Victor Julien	ecf86f9c23	Rename to Suricata.	16 years ago
Pablo Rincon	e26833be3f	Changing mutex/spinlocks/conditions naming types	16 years ago
Pablo Rincon	769022f4be	Adding support for Mac OS X, FreeBSD, centrailizing mutex/spins/conditions in a macro API, and some unittests	16 years ago
Victor Julien	689bbfdc45	Rename all structure definitions in the "typedef struct _SomeStruct" format to "typedef struct SomeStruct_" to make the Doxygen output more useful. Remove the Trie multi pattern matcher code. It wasn't used anymore.	17 years ago
Victor Julien	bab4b62376	Initial add of the files.	17 years ago

18 Commits (d6460392c51b2f6cbbfc6316eb96e0926ddc1d25)