Undo the increase in sequence number by 1 due to the FIN flag in case of a transient error.
ClosedPublic
Actions

Authored by rscheff on Jul 1 2015, 5:46 PM.

Details

Reviewers

jch
lstewart
gnn
rrs
tuexen
glebius
hiren

Group Reviewers

transport

Commits

rGd730ffcd6ad3: tcp: Undo the consumption of sequence space by FIN in case of a transient error.
rG66605ff791b1: tcp: Undo the increase in sequence number by 1 due to the FIN flag in case of a…

Summary

If an error occurs while processing a TCP segment with some data and the FIN
flag, the back out of the sequence number advance does not take into account the
increase by 1 due to the FIN flag.

Diff Detail

Repository

rG FreeBSD src repository

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

hiren updated this revision to Diff 6627.Jul 1 2015, 5:46 PM

hiren retitled this revision from to Undo the increase in sequence number by 1 due to the FIN flag in case of a transient error..

hiren updated this object.

hiren edited the test plan for this revision. (Show Details)

hiren added reviewers: lstewart, jch.

hiren added a subscriber: network.

Herald added a subscriber: imp. · View Herald TranscriptJul 1 2015, 5:46 PM

https://lists.freebsd.org/pipermail/freebsd-net/2015-June/042493.html is the original post which has 2 patches. I somehow liked the second patch as it deals with the FIN case specifically. If someone feels otherwise, please comment.

hiren added a reviewer: gnn.Jul 1 2015, 5:57 PM

hiren added a subscriber: fabient.

Can someone please look at this?

gnn accepted this revision.Jul 8 2015, 12:07 AM

gnn edited edge metadata.

This revision is now accepted and ready to land.Jul 8 2015, 12:07 AM

Thanks George.

I'll commit this next Friday (07/17) if I don't get any other feedback.

This change looks good to me.

As a side note, I really dislike the conflation of logical sequence space and data accounting used in many places in our stack. It's something that's fairly straight forward to address and I have some proof of concept patches I did a while ago which we should dust off at some point.

In D2970#71239, @lstewart wrote:

As a side note, I really dislike the conflation of logical sequence space and data accounting used in many places in our stack. It's something that's fairly straight forward to address and I have some proof of concept patches I did a while ago which we should dust off at some point.

Yes, we should. Let me know if/when you need help with testing/reviewing. I'll be happy to do it.

Are you okay with the patch proposed here?

This change seems inadequate given that we would have set TF_SENTFIN and updated snd_max. I haven't followed through all the implications of not reverting those changes, but if we're going to attempt a state rollback we'd better make sure we get it right. I'm also a bit unclear on some details in the original report given that an RTO would reset snd_nxt to snd_una and get us out of any permanent pickle. I'm not a fan of rollbacks in general as they're fragile. What's the use case where a rollback here matters?

In D2970#71255, @lstewart wrote:

This change seems inadequate given that we would have set TF_SENTFIN and updated snd_max. I haven't followed through all the implications of not reverting those changes, but if we're going to attempt a state rollback we'd better make sure we get it right. I'm also a bit unclear on some details in the original report given that an RTO would reset snd_nxt to snd_una and get us out of any permanent pickle. I'm not a fan of rollbacks in general as they're fragile. What's the use case where a rollback here matters?

Another solution proposed is to do like if the packet was lost (ignore the error). I also think that a rollback is difficult to maintain all over the code. Regarding the problem we faced: there was a FIN+DATA that was blocked by the local stack. In that case you loose 1 byte of the data each time you refuse to send it. The other effect is that it create a packet storm (if you have some process that request the missing data).
This fix is tested since one month and we never saw the problem again.

Is there any more comment to fix it ?
lstewart@ do you prefer the no rollback in case of error (will do the same as packet loss) ?
both solution is are better than keeping the code like this.

chris_cretaforce.gr added a subscriber: chris_cretaforce.gr.Jun 29 2022, 9:28 PM

Herald added a reviewer: transport. · View Herald TranscriptJun 29 2022, 9:28 PM

Herald added subscribers: glebius, melifaro. · View Herald Transcript

This revision now requires review to proceed.Jun 29 2022, 9:28 PM

rscheff mentioned this in D35446: tcp: Check if we exceed socket send buffer instead of snd_fack.Jun 30 2022, 6:31 AM

zlei added a subscriber: zlei.Jun 30 2022, 10:02 AM

Taking over this Diff as discussed in last transport call, before rebase/update.

assert that SACK rxmit does not send FIN bit

Harbormaster completed remote builds in B46275: Diff 107803.Jul 6 2022, 6:52 AM

grahamperrin added a subscriber: grahamperrin.Jul 12 2022, 4:32 AM

tuexen accepted this revision.Jul 14 2022, 3:31 PM

This revision is now accepted and ready to land.Jul 14 2022, 3:31 PM

Closed by commit rG66605ff791b1: tcp: Undo the increase in sequence number by 1 due to the FIN flag in case of a… (authored by rscheff). · Explain WhyJul 15 2022, 4:36 PM

This revision was automatically updated to reflect the committed changes.

rscheff added a commit: rG66605ff791b1: tcp: Undo the increase in sequence number by 1 due to the FIN flag in case of a….

While this issue affects data segments with FIN (the left edge of the retransmitted packet shifts to the right by 1 byte per retransmission), that appears to be a different symptom than the recent panics on SACK rescue retransmission. In the transport call we agreed to commit this against HEAD, but not (yet) MFC this patch.

rscheff added a commit: rGd730ffcd6ad3: tcp: Undo the consumption of sequence space by FIN in case of a transient error..Jan 11 2024, 12:41 AM

Revision Contents
Changeset List

Path

Size

sys/

netinet/

tcp_output.c

7 lines

Diff 108205

View Options

Undo the increase in sequence number by 1 due to the FIN flag in case of a transient error.ClosedPublicActions