Add TCP out of order or duplicate segments sampler via BPF #255

rittme · 2021-11-22T06:56:41Z

Problem

Add two new TCP probes:

A probe to measure duplicate TCP segments. This allow us to detect ACK are not being received.
A probe to measure out of order TCP segments. Out of order segments causes retransmissions and can affect latency.

Solution

Added two BPF kernel probes to count du and RTO events.

Result

Two new metrics introduced to TCP sampler:

tcp/receive/duplicate
tcp/receive/out_of_order

WUMUXIAN · 2021-11-23T09:05:00Z

src/samplers/tcp/stat.rs

+            binary_path: None,
+            sub_system: None,
+        };
+        let tcp_ooo_probe = Probe {


this probe is identical as the above one. We can just introduce a probe called

tcp_validate_incoming_probe and attach it to both telemetries

Self::Duplicate => vec![tcp_validate_incoming_probe],
Self::OutOfOrder => vec![tcp_validate_incoming_probe],

Cool, I was wondering if I could do that :)

WUMUXIAN · 2021-11-23T09:38:48Z

src/samplers/tcp/bpf.c

+
+    // Segment sequence before the expected one
+    // which means this was a duplicated segment
+    if ((end_seq - tp->rcv_wup) < 0) {


I feel the starting sequence number TCP_SKB_CB(skb)->seq and the expected sequence number is tp->rcv_nxt is more straightforward to compare

if TCP_SKB_CB(skb)->seq is smaller than tp->rcv_nxt, it's confirmed a duplicate.

'rcv_wup' records the rcv_nxt on last window update sent, comparing it to the end_seq of this segment does not seem correct to me.

I'm basing these on this function that is used to validate the segment sequence on the kernel:
https://elixir.bootlin.com/linux/v4.2/source/net/ipv4/tcp_input.c#L3902

From the comments it states:

Also, controls (RST is main one) are accepted using RCV.WUP instead of RCV.NXT. Peer still did not advance his SND.UNA when we delayed ACK, so that hisSND.UNA<=ourRCV.WUP.

So to validate it seems better if we use RCV.WUP. But maybe for probing duplicates we don't care about this and should use RCV.NXT anyways?

I think what

!before(end_seq, tp->rcv_wup) && !after(seq, tp->rcv_nxt + tcp_receive_window(tp))

checks eventually is that whether the sender's segment [seq, end_seq] has any overlays with the expected window at all.
The first check finds out if the end_seq is even before the previous rcv_nxt sent, the second check finds out if the start seq is even after the largest possible seq (which is rcv_nxt + window size).

For us we simply wants to check if the segment is a duplicate, and checking end_seq - tp->rcv_wup can't help, for example, when (end_seq - tp->rcv_wup) <0, we can confirm that it's a duplicate, but when (end_seq - tp->rcv_wup) > 0, it can still be a duplicate, as long as seq < tp->rcv_nxt.

So basically to check duplicates, we should be looking seq and tp->rct_nxt. What do you think?

Yes, this makes sense. I made the changes to use RCV.NXT instead.

WUMUXIAN · 2021-11-24T06:28:33Z

src/samplers/tcp/bpf.c

+    // Segment sequence after the expected receive window
+    // which means this segment was received out of order
+    u32 window_end = tp->rcv_nxt + tcp_receive_window(tp);
+    if ((window_end - seq) < 0) {


Similarly here, I think we can simply check whether if seq > tp->rcv_nxt here to see if the segment arrived out of order.
window_end < seq only tell that the current segment is totally out of the window, but when there's an overlay, it can't capture.

brayniac · 2021-11-24T22:26:43Z

This LGTM from a Rust perspective. I think it can be marked as ready for review at this point?

@WUMUXIAN - please give a shipit for the BCC portion when you're happy with that and I'll give it another look before merging.

WUMUXIAN

looks good to me, I think we can upgrade from draft status and get it merged.

rittme · 2021-11-30T02:38:33Z

@brayniac feel free to merge and close this PR when possible. Thanks!

rittme added 5 commits November 22, 2021 13:27

Add TCP out of order or duplicate segments sampler via BPF

5ee3872

Merge branch 'master' into tcp_ooo_duplicate

964eda6

Add docs

3d26493

rename BPF map for duplicate

b0f61e9

fix format

7460390

WUMUXIAN reviewed Nov 23, 2021

View reviewed changes

use one probe for both stats

a0a16b5

WUMUXIAN reviewed Nov 24, 2021

View reviewed changes

use RCV.NXT instead of window

df0df93

WUMUXIAN approved these changes Nov 25, 2021

View reviewed changes

rittme marked this pull request as ready for review November 25, 2021 02:19

brayniac merged commit cb92b00 into twitter:master Nov 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TCP out of order or duplicate segments sampler via BPF #255

Add TCP out of order or duplicate segments sampler via BPF #255

rittme commented Nov 22, 2021

WUMUXIAN Nov 23, 2021

rittme Nov 23, 2021

WUMUXIAN Nov 23, 2021

WUMUXIAN Nov 23, 2021

rittme Nov 24, 2021 •

edited

Loading

WUMUXIAN Nov 24, 2021

rittme Nov 24, 2021

WUMUXIAN Nov 24, 2021

brayniac commented Nov 24, 2021 •

edited

Loading

WUMUXIAN left a comment •

edited

Loading

rittme commented Nov 30, 2021

Add TCP out of order or duplicate segments sampler via BPF #255

Add TCP out of order or duplicate segments sampler via BPF #255

Conversation

rittme commented Nov 22, 2021

WUMUXIAN Nov 23, 2021

Choose a reason for hiding this comment

rittme Nov 23, 2021

Choose a reason for hiding this comment

WUMUXIAN Nov 23, 2021

Choose a reason for hiding this comment

WUMUXIAN Nov 23, 2021

Choose a reason for hiding this comment

rittme Nov 24, 2021 • edited Loading

Choose a reason for hiding this comment

WUMUXIAN Nov 24, 2021

Choose a reason for hiding this comment

rittme Nov 24, 2021

Choose a reason for hiding this comment

WUMUXIAN Nov 24, 2021

Choose a reason for hiding this comment

brayniac commented Nov 24, 2021 • edited Loading

WUMUXIAN left a comment • edited Loading

Choose a reason for hiding this comment

rittme commented Nov 30, 2021

rittme Nov 24, 2021 •

edited

Loading

brayniac commented Nov 24, 2021 •

edited

Loading

WUMUXIAN left a comment •

edited

Loading