Adding a check of the MBZ (Must Be Zero) fields in the
incoming tx and rx completion descriptors in order to
identify corrupted descriptors.
Approved by: cperciva
MFC after: 2 weeks
Sponsored by: Amazon, Inc.
Differential D46337
ena: Add completion descriptor corruption check osamaabb_amazon.com on Aug 20 2024, 8:45 AM. Authored by Tags None Referenced Files
Subscribers
Details
Adding a check of the MBZ (Must Be Zero) fields in the Approved by: cperciva
Diff Detail
Event TimelineComment Actions Is returning an error the right response here? My initial reaction is that this should be a kernel panic, but maybe it's easier to track down such faults if the system keeps running? Comment Actions I feel like a kernel panic is a bit of an overkill here, for these cases we currently reset the driver with ENA_REGS_RESET_RX_DESCRIPTOR_MALFORMED reset reason Comment Actions Ok, your call. I think there's tension here between availability and integrity -- my general position is "if we detected that one thing is corrupted, who knows how much undetected corruption has happened, so we should panic and reboot into a clean state rather than trusting anything about the currently running system". But obviously it's a tradeoff and you know the context of the system better than I do. |