Page MenuHomeFreeBSD

mca: accumulate statistics of MCA events and export as hw.mca.stats
Needs ReviewPublic

Authored by glebius on Jan 13 2022, 7:06 PM.
Tags
None
Referenced Files
Unknown Object (File)
Sat, Oct 11, 4:18 AM
Unknown Object (File)
Aug 24 2025, 1:43 AM
Unknown Object (File)
Aug 8 2025, 8:25 AM
Unknown Object (File)
Jul 16 2025, 8:25 PM
Unknown Object (File)
Jul 5 2025, 1:01 PM
Unknown Object (File)
Apr 14 2025, 2:59 PM
Unknown Object (File)
Mar 22 2025, 4:17 AM
Unknown Object (File)
Feb 24 2025, 11:48 PM
Subscribers

Details

Reviewers
mav

Diff Detail

Repository
rS FreeBSD src repository - subversion
Lint
Lint Passed
Unit
No Test Coverage
Build Status
Buildable 43906
Build 40794: arc lint + arc unit

Event Timeline

I have doubts such statistics would be usable. Same memory error can be reported in different ways depending how it was found (read, write, scrub, etc). Same time error reported by cache may mean faulty cache or memory error just detected by cache. Excessive simplification like this would tangle things even more. If you wish to have a simple counter, I would only differentiate corrected, uncorrected recoverable and uncorrected unrecoverable (the last should actually end up in panic now, but in some cases could just kill the process). If more detailed information is needed, then some user-space tool like mcelog can get raw data via the sysctl interface.