Details

Reviewers

gallatin
hiren
jch

Group Reviewers

transport

Commits

rS304218: This cleans up the timer code in TCP and also makes it so we do not

Summary

The TCP timer code as been fraught with churn in the effort to fix its
racy use of timers. We have moved to ASYNC drain which cleans up a lot
of this but still have some hanging flags and such around that should be cleaned
up. Also the use of two locks at once in the timer code is problematic since the
timer code only really assures that one lock can be dealt with properly in the
drain functions. Therefore we will move to a "switch" locks type method so that
a future patch can get rid of the async drain all the way and move the lock
under the callout system (where it belongs).

Test Plan

Beat the heck out of it on a NF workload and hopefully get Versign to give it a go as well.

Note I do not intend for this to go into 11 but to wait until after head forks at some point unless
I hear a lot of pushback that we need it sooner

Diff Detail

Repository

rS FreeBSD src repository - subversion

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

rrs updated this revision to Diff 18186.Jul 6 2016, 11:12 AM

rrs retitled this revision from to TCP Timer cleanup.

rrs updated this object.

rrs edited the test plan for this revision. (Show Details)

rrs added a reviewer: transport.

jch added a reviewer: jch.Jul 6 2016, 11:26 AM

gallatin added a reviewer: gallatin.Jul 13 2016, 8:36 PM

In addition to helping the callout correctness, this seems to improve lock contention on tcbinfo, since this lock is now taken only when needed, not when entering every routine (thereby blocking writers, or blocking when a writer has the lock).

I've been running this over a day with ~100K connections at ~80Gb/s at Netflix (in a FreeBSD-11 context) with no issues. I've also tested it on a WITNESS + INVARIANTS kernel, and saw no LORs and no kassert failures (at 5Gb/s :)

This revision is now accepted and ready to land.Jul 13 2016, 8:41 PM

Fix it so we don't leak inp's.. the tcp_close() can return NULL in tp, this
would mean we would not do the reference count release. Instead use
the inp.

Also change the name to reflect what is actually being done (advice from Michael Tuexen :-D)

This revision now requires review to proceed.Jul 19 2016, 9:16 AM

Turns out when tp is returned NULL by the drop/close that means
it also nicely released the INP_WLOCK. We need to re-acqurire the lock
in that case so we can drop our inp reference.

I really need to learn how to type :-)

minor style 9 nit

gallatin accepted this revision.Jul 26 2016, 3:32 PM

gallatin edited edge metadata.

This revision is now accepted and ready to land.Jul 26 2016, 3:32 PM

Been using a slight variation of this change without any problems.

Closed by commit rS304218: This cleans up the timer code in TCP and also makes it so we do not (authored by rrs). · Explain WhyAug 16 2016, 12:41 PM

This revision was automatically updated to reflect the committed changes.

rrs mentioned this in rS304218: This cleans up the timer code in TCP and also makes it so we do not.

TCP Timer cleanup
ClosedPublic
Actions

Details

Diff Detail

Event Timeline

Revision Contents
Changeset List

Diff 19342

head/sys/netinet/tcp_timer.h

head/sys/netinet/tcp_timer.c

TCP Timer cleanupClosedPublicActions

Details

Diff Detail

Event Timeline

Revision ContentsChangeset List

Diff 19342

head/sys/netinet/tcp_timer.h

head/sys/netinet/tcp_timer.c

TCP Timer cleanup
ClosedPublic
Actions

Revision Contents
Changeset List