Make sure end of receive doesn't cause interrupt starvation in iflib
ClosedPublic
Actions

Authored by • hselasky on Jan 22 2020, 2:10 PM.

Details

Reviewers

mmacy
gallatin
erj
jhb
kib
shurd
marius
np

Group Reviewers

iflib

Commits

rS358272: MFC r357799:
rS358271: MFC r357799:
rS357799: Make sure the so-called end of receive interrupts don't starve in iflib.

Summary

When the receive ring cannot be filled with mbufs, due to lack of memory, no more interrupts will be generated to fill the receive ring later on. Make sure to have a watchdog, to try refilling the receive ring from time to time, hopefully when more mbufs are available.

Affects all clients of iflib.

Diff Detail

Repository

rS FreeBSD src repository - subversion

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

• hselasky created this revision.Jan 22 2020, 2:10 PM

Herald added a reviewer: shurd. · View Herald TranscriptJan 22 2020, 2:10 PM

Herald added a reviewer: iflib. · View Herald Transcript

Herald added subscribers: ae, imp. · View Herald Transcript

Harbormaster completed remote builds in B28843: Diff 67143.Jan 22 2020, 2:10 PM

• hselasky added a reviewer: marius.Jan 22 2020, 2:28 PM

gallatin added inline comments.Jan 22 2020, 3:41 PM

sys/net/iflib.c
2095 ↗	(On Diff #67143)	I'm confused.. What if _iflib_fl_refill() actually refills? Then we are no longer empty but say we are? Sorry if I'm missing something, I have not looked at this code in a very long time.

• hselasky added inline comments.Jan 22 2020, 3:46 PM

sys/net/iflib.c
2095 ↗	(On Diff #67143)	I'll move the in-use check after the fl_refill() as an optimisation.

Handle comments from Drew.

Harbormaster completed remote builds in B28846: Diff 67146.Jan 22 2020, 3:51 PM

• hselasky marked an inline comment as done.Jan 22 2020, 3:52 PM

gallatin accepted this revision.Jan 22 2020, 8:06 PM

This revision is now accepted and ready to land.Jan 22 2020, 8:06 PM

Testing revealed some more patches were needed.

Test OK.

This revision now requires review to proceed.Jan 22 2020, 9:39 PM

Harbormaster completed remote builds in B28862: Diff 67164.Jan 22 2020, 9:39 PM

gallatin accepted this revision.Jan 22 2020, 9:53 PM

This revision is now accepted and ready to land.Jan 22 2020, 9:53 PM

• hselasky added a reviewer: np.Jan 22 2020, 9:59 PM

As we discussed in slack, I'm not a huge fan of fixing things with a callout. If the hardware was amenable, I'd much rather leave the ring partially stocked and drop new packets until we were able to allocate mbufs again. However, based on the findings you reported with igb (it wanting a full rx ring to generate interrupts), I'm afraid that might require too much work from hardware drivers. And I'd prefer a fix to a real problem, even if its not something I personally find appealing, to leaving a real bug unfixed and having machines become unreachable.

Can you describe a bit what this is trying to address? This seems like a hopefully temporary condition that should be opted in by broken drivers, not default for every iflib driver.

I think we didn't see this at LLNW because we ran with much larger number of descriptors (4096 for em, 8192 for igb). Those should probably be set as such for appropriate hardware, we just didn't have a secret codex to make better decisions for different NIC types and would have needed some assistance from Intel to look through docs and errata to do so.

I don't think any of the past corporate sponsors of this work are stepping up so the community is going to have to here. @marius you have the most recent experience looking deeply at the e1000 code, can you share an opinion on the issue and proposal?

In D23315#511436, @kbowling wrote:

Can you describe a bit what this is trying to address? This seems like a hopefully temporary condition that should be opted in by broken drivers, not default for every iflib driver.

I am addressing an issue where filling the RX descriptors stop and so all RX packet processing, because the hardware doesn't output any more RXEOF interrupts. This happens in conjunction with temporary out of mbufs situations.

--HPS

Is there some way to make the problem easy to reproduce like reducing descriptors?

Simply force a m_getcl() failure like this:

static int counter;

counter++;
if (counter >= 10000 && counter < 40000)

break;

--HPS

In D23315#511436, @kbowling wrote:

I don't think any of the past corporate sponsors of this work are stepping up so the community is going to have to here. @marius you have the most recent experience looking deeply at the e1000 code, can you share an opinion on the issue and proposal?

Well, this is just one of many regressions that came with the conversion of the e1000 drivers to iflib and as you imply, looking up all relevant details is tedious work, if such information is publicly available at all.
Based on the - somewhat sparse - description of the problem for the IGB class, my first thought would be to use interrupt moderation to force another RX interrupt after some time, though, i. e. effectively to implement a callout in hardware. This should require only a few lines of code in if_em.c, but might need interrupt auto-clearing to be disabled so another interrupt is actually triggered when the counter reaches 0 (auto-clearing certainly is a nice feature, but hardly a loss for 1 GbE). Actually, at a quick glance the re-arming of EITR previously done in igb_msix_que() on every MSI-X interrupt was lost with the conversion to iflib, so bringing back that code and also doing it in the INTx and MSI case already might be most of what's necessary for this approach.
IIRC, >= 10 GbE Intel MACs have two kinds of interrupt moderation timers, making forcing another RX interrupt even more straight forward.

Closed by commit rS357799: Make sure the so-called end of receive interrupts don't starve in iflib. (authored by • hselasky). · Explain WhyFeb 12 2020, 8:30 AM

This revision was automatically updated to reflect the committed changes.

• hselasky added a commit: rS357799: Make sure the so-called end of receive interrupts don't starve in iflib..

• hselasky added a commit: rS358271: MFC r357799:.Feb 24 2020, 9:39 AM

Herald added a subscriber: melifaro. · View Herald TranscriptFeb 24 2020, 9:39 AM

• hselasky added a commit: rS358272: MFC r357799:.Feb 24 2020, 9:50 AM