Page MenuHomeFreeBSD

if_epair: fix race condition on multi-core systems
ClosedPublic

Authored by grembo on Mar 15 2022, 5:48 PM.
Tags
None
Referenced Files
Unknown Object (File)
Fri, Jan 24, 5:27 PM
Unknown Object (File)
Fri, Jan 24, 4:48 PM
Unknown Object (File)
Fri, Jan 24, 2:11 AM
Unknown Object (File)
Sat, Jan 18, 9:16 PM
Unknown Object (File)
Thu, Jan 9, 6:39 PM
Unknown Object (File)
Thu, Jan 9, 5:33 PM
Unknown Object (File)
Mon, Jan 6, 6:00 AM
Unknown Object (File)
Mon, Jan 6, 6:00 AM

Details

Summary

As an unwanted side effect of the performance improvements in
24f0bfbad57b9, epair interfaces stop forwarding traffic on higher
load levels when running on multi-core systems.

This happens due to a race condition in the logic that decides when to
place work in the task queue(s) responsible for processing the content
of ring buffers.

In order to fix this, a field named state is added to the epair_queue
structure. This field is used by the affected functions to signal each
other that something happened in the underlying ring buffers that might
require work to be scheduled in task queue(s), replacing the existing
logic, which relies on checking if ring buffers are empty or not.

epair_menq() does:
  - set BIT_MBUF_QUEUED
  - queue mbuf
  - if testandset BIT_QUEUE_TASK:
      enqueue task

epair_tx_start_deferred() does:
  - swap ring buffers
  - process mbufs
  - clear BIT_QUEUE_TASK
  - if testandclear BIT_MBUF_QUEUED
      enqueue task

As 13.1-PRERELEASE is affected, MFCing is imperative.

PR: 262571
Reported by: Johan Hendriks <joh.hendriks@gmail.com>
MFC after: 3 days

Test Plan

Diff Detail

Repository
rS FreeBSD src repository - subversion
Lint
Lint Passed
Unit
No Test Coverage
Build Status
Buildable 44783
Build 41671: arc lint + arc unit

Event Timeline

This looks good to me. I'll give bz@ another day to comment and then I'll commit.

I've run my previous benchmarks, and I'm not seeing major performance effects of this patch.

This revision was not accepted when it landed; it landed in state Needs Review.Mar 16 2022, 11:30 PM
This revision was automatically updated to reflect the committed changes.