Page MenuHomeFreeBSD

tcp: fix detection of bad RTOs
ClosedPublic

Authored by tuexen on Mar 19 2025, 10:04 PM.
Tags
None
Referenced Files
F133457251: D49414.id152467.diff
Sat, Oct 25, 10:50 PM
Unknown Object (File)
Fri, Oct 24, 12:15 AM
Unknown Object (File)
Sun, Oct 19, 1:01 AM
Unknown Object (File)
Sat, Oct 18, 2:11 AM
Unknown Object (File)
Thu, Oct 16, 5:23 PM
Unknown Object (File)
Wed, Oct 15, 9:40 PM
Unknown Object (File)
Wed, Oct 15, 3:50 AM
Unknown Object (File)
Sun, Oct 12, 6:46 PM

Details

Summary

If timestamps are enabled, the actions does by a retransmission timeout were rolled back,
when they should not. We need to make sure the incoming packet advances the SND.UNA.

To do this, remove the incorrect upfront check and extend the check in the fast path to
handle also the case of timestamps.

This addresses the recently observed panics with the message: sent too much.

Diff Detail

Repository
rG FreeBSD src repository
Lint
Lint Skipped
Unit
Tests Skipped

Event Timeline

sys/netinet/tcp_input.c
1805

Any specific reason to write the or statement as two separate if clauses?

I believe in other parts we do it ((true & x & y & z) | (false & a & b &c )) - as multiline checks;

I don't like them both get executed "always", so at the very least, an "else" just before the 2nd if would be good.

The logic looks ok to me.

sys/netinet/tcp_input.c
1805

I agree with Richard, better one complex if clause. The two clauses are mutually exclusive as one wants TOF_TS and other !TOF_TS, so cc_cong_signal() can be executed only once. My first reading was that it is intentionally written this way cause we may want two calls into cc_cong_singal(), so it was a misleading reading.

Oh, the upper level embracing if can also be merged into one big clause:

if ((tp->t_rxtshift == 1 &&
    tp->t_flags & TF_PREVVALID &&
    tp->t_badrxtwin != 0) &&
    (((to.to_flags & TOF_TS) != 0 &&
    (to.to_tsecr != 0) &&
    TSTMP_LT(to.to_tsecr, tp->t_badrxtwin)) ||
    ((to.to_flags & TOF_TS) == 0 &&
    TSTMP_LT(ticks, tp->t_badrxtwin))))
        cc_cong_signal(tp, th, CC_RTO_ERR);

This will also fix lines longer than 79 chars. IMHO, we may slightly violate style(9) and add single space indentation to highlight our three main large blocks (a && (b || c)).

Use one big condition and try to use indentation to structure the boolean expression.

tuexen added inline comments.
sys/netinet/tcp_input.c
1805

It was a try to break things up for improved readability by humans.

1805

Now using one and two spaces of indentation to make the expression more readable.

This revision is now accepted and ready to land.Mar 20 2025, 9:21 AM

This is for the fast path fix. Just want to confirm with your that a second patch may be around to improve both of the fast and slow path?

In D49414#1127111, @cc wrote:

This is for the fast path fix. Just want to confirm with your that a second patch may be around to improve both of the fast and slow path?

For the slow path no patch is needed. It already checks that SND.UNA is moved forward and only the timestamp condition is checked.

We can improve the detection of bad RTO later (for example also take D-SACKs into account), but I wanted to focus on fixing incorrect code. The code being incorrect is the one I am removing. Just removing it, would not consider the timestamp based criterium on the fast path, therefore I added it there.

This revision was automatically updated to reflect the committed changes.

thanks, can't reproduce this issue anymore after this patch

All go for 🗡 week.