Afpacket improvements/v13 #6551

victorjulien · 2021-10-31T22:09:14Z

Continuation of #6544 work.

https://redmine.openinfosecfoundation.org/issues/4785
https://redmine.openinfosecfoundation.org/issues/4796
https://redmine.openinfosecfoundation.org/issues/4800

Add tpacket v2 ring diag code that dumps the ring if we appear stuck for a long time.

The Suricata AF_PACKET code opens a socket per thread, then after some minor setup enters a loop where the socket is poll()'d with a timeout. When the poll() call returns a non zero positive value, the AF_PACKET ring will be processed. The ringbuffer processing logic has a pointer into the ring where we last checked the ring. From this position we will inspect each frame until we find a frame with tp_status == TP_STATUS_KERNEL (so essentially 0). This means the frame is currently owned by the kernel. There is a special case handling for starting the ring processing but finding a TP_STATUS_KERNEL immediately. This logic then skip to the next frame, rerun the check, etc until it either finds an initialized frame or the last frame of the ringbuffer. The problem was, however, that the initial uninitialized frame was possibly (likely?) still being initialized by the kernel. A data race between the notification through the socket (the poll()) and the updating of the `tp_status` field in the frame could lead to a valid frame getting skipped. Of note is that for example libpcap does not do frame scanning. Instead it simply exits it ring processing loop. Also interesting is that libpcap uses atomic loads and stores on the tp_status field. This skipping of frames had 2 bad side effects: 1. in most cases, the buffer would be full enough that the frame would be processed in the next pass of the ring, but now the frame would out of order. This might have lead to packets belong to the same flow getting processed in the wrong order. 2. more severe is the soft lockup case. The skipped frame sits at ring buffer index 0. The rest of the ring has been cleared, after the initial frame was skipped. As our pass of the ring stops at the end of the ring (ptv->frame_offset + 1 == ptv->req.v2.tp_frame_nr) the code exits the ring processing loop at goes back to poll(). However, poll() will not indicate that there is more data, as the stale frame in the ring blocks the kernel from populating more frames beyond it. This is now a dead lock, as the kernel waits for Suricata and Suricata never touches the ring until it hears from the kernel. The scan logic will scan the whole ring at most once, so it won't reconsider the stale frame either. This patch addresses the issues in several ways: 1. the startup "discard" logic was fixed to not skip over kernel frames. Doing so would get us in a bad state at start up. 2. Instead of scanning the ring, we now enter a busy wait loop when encountering a kernel frame where we didn't expect one. This means that if we got a > 0 poll() result, we'll busy wait until we get at least one frame. 3. Error handling is unified and cleaned up. Any frame error now returns the frame to the kernel and progresses the frame pointer. 4. If we find a frame that is owned by us (TP_STATUS_USER_BUSY) we yield to poll() immediately, as the next expected status of that frame is TP_STATUS_KERNEL. 5. the ring is no longer processed until the "end" of the ring (so highest index), but instead we process at most one full ring size per run. 6. Work with a copy of `tp_status` instead of accessing original touched also by the kernel. Bug: OISF#4785.

Avoid colision of TP_STATUS_USER_BUSY with TP_STATUS_TS_RAW_HARDWARE, both were using bit 31.

Flag was always set for tpacket v2 and v3, which meant the check for it in the packet setup paths were useless.

Ticket: OISF#4796. V2 (for IDS and IPS) and V3 (for IDS) are widely supported. V2 was introduced in 2008, so we can safely assume that all systems can run V2+.

Leave no runtime checks for bypass/ebpf/xdp if not compiled in.

victorjulien · 2021-10-31T22:14:40Z

src/source-af-packet.c

+ * hasn't been released by a worker thread.
+ *
+ * We use bits 29, 30, 31. 29 and 31 are software and hardware
+ * timestamps. 30 should not be used. Combined they should never


kernel says it should never set this

#define TP_STATUS_TS_SYS_HARDWARE (1 << 30) /* deprecated, never set */

victorjulien · 2021-10-31T22:19:10Z

Flow counter commit doesn't belong here.

First afpacket commit should start with af-packet:

codecov · 2021-10-31T22:26:53Z

Codecov Report

Merging #6551 (f4cb255) into master (286c510) will decrease coverage by 0.04%.
The diff coverage is 2.42%.

@@            Coverage Diff             @@
##           master    #6551      +/-   ##
==========================================
- Coverage   77.12%   77.07%   -0.05%     
==========================================
  Files         613      613              
  Lines      186750   186714      -36     
==========================================
- Hits       144028   143909     -119     
- Misses      42722    42805      +83

Flag	Coverage Δ
fuzzcorpus	`52.99% <1.61%> (-0.12%)`	⬇️
suricata-verify	`52.08% <2.55%> (-0.03%)`	⬇️
unittests	`63.12% <0.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

suricata-qa · 2021-11-01T02:00:05Z

WARNING:

field	test	baseline	%
	tlpr1_stats_chk
.flow.wrk.flows_evicted	5304431	4693221	113.02%
.flow.mgr.rows_maxlen	115	539	21.34%

Pipeline 4676

victorjulien · 2021-11-01T07:47:51Z

Replaced by #6553

Ticket: OISF#6551

Ticket: OISF#6551 (cherry picked from commit 180459e)

victorjulien added 9 commits October 30, 2021 14:27

flow/worker: track local evictions & injected flows

bd90241

afpacket: detect threads getting stuck

4a60f80

Add tpacket v2 ring diag code that dumps the ring if we appear stuck for a long time.

af-packet: avoid flag colision with kernel

eed99f1

Avoid colision of TP_STATUS_USER_BUSY with TP_STATUS_TS_RAW_HARDWARE, both were using bit 31.

af-packet: remove zero copy flag

9b85be8

Flag was always set for tpacket v2 and v3, which meant the check for it in the packet setup paths were useless.

af-packet: remove obsolete define

924234b

af-packet: remove tpacket-v1 support

a75b66c

Ticket: OISF#4796. V2 (for IDS and IPS) and V3 (for IDS) are widely supported. V2 was introduced in 2008, so we can safely assume that all systems can run V2+.

af-packet: minor code cleanups

f235efd

af-packet: hide all ebpf/bypass logic behind guards

f4cb255

Leave no runtime checks for bypass/ebpf/xdp if not compiled in.

victorjulien commented Oct 31, 2021

View reviewed changes

victorjulien mentioned this pull request Oct 31, 2021

Afpacket softlock testing/v6 #6544

Closed

victorjulien mentioned this pull request Nov 1, 2021

Afpacket improvements/v15 #6553

Closed

victorjulien closed this Nov 1, 2021

victorjulien deleted the afpacket-improvements/v13 branch April 14, 2022 13:27

regit added a commit to regit/suricata that referenced this pull request Nov 18, 2023

detect/stream_size: fix prefiltering registration

5ca0576

Ticket: OISF#6551

victorjulien pushed a commit to victorjulien/suricata that referenced this pull request Nov 19, 2023

detect/stream_size: fix prefiltering registration

180459e

Ticket: OISF#6551

catenacyber pushed a commit to catenacyber/suricata that referenced this pull request Jan 17, 2024

detect/stream_size: fix prefiltering registration

91a91e9

Ticket: OISF#6551 (cherry picked from commit 180459e)

victorjulien pushed a commit to victorjulien/suricata that referenced this pull request Jan 18, 2024

detect/stream_size: fix prefiltering registration

e1f2555

Ticket: OISF#6551 (cherry picked from commit 180459e)

jlucovsky pushed a commit to jlucovsky/suricata that referenced this pull request Jan 20, 2024

detect/stream_size: fix prefiltering registration

6e3a993

Ticket: OISF#6551 (cherry picked from commit 180459e)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Afpacket improvements/v13 #6551

Afpacket improvements/v13 #6551

victorjulien commented Oct 31, 2021

victorjulien Oct 31, 2021

victorjulien commented Oct 31, 2021

codecov bot commented Oct 31, 2021

suricata-qa commented Nov 1, 2021

victorjulien commented Nov 1, 2021

Afpacket improvements/v13 #6551

Afpacket improvements/v13 #6551

Conversation

victorjulien commented Oct 31, 2021

victorjulien Oct 31, 2021

Choose a reason for hiding this comment

victorjulien commented Oct 31, 2021

codecov bot commented Oct 31, 2021

Codecov Report

suricata-qa commented Nov 1, 2021

victorjulien commented Nov 1, 2021