Page MenuHomeFreeBSD

epoch: support non-preemptible epochs checking in_epoch()
ClosedPublic

Authored by kevans on Nov 5 2020, 4:35 AM.
Tags
None
Referenced Files
Unknown Object (File)
Wed, Dec 11, 11:04 AM
Unknown Object (File)
Sat, Dec 7, 10:56 PM
Unknown Object (File)
Nov 21 2024, 12:52 PM
Unknown Object (File)
Nov 9 2024, 9:32 PM
Unknown Object (File)
Sep 29 2024, 1:18 PM
Unknown Object (File)
Sep 28 2024, 1:20 PM
Unknown Object (File)
Sep 17 2024, 4:09 PM
Unknown Object (File)
Sep 17 2024, 11:15 AM
Subscribers

Details

Summary

Twice I've tried using epoch and managed to forget that in_epoch() only operates on preemptible epochs, so here's a first cut at rectifying this.

For default epochs, it's easy enough to verify that we're in the given epoch (AFAICT): if we're in a critical section and our record for the given epoch is active, then we're in it.

This patch also adds some additional INVARIANTS bookkeeping. Notably, we set and check the recorded thread in epoch_enter/epoch_exit to try and catch some edge-cases for the caller. It also checks upon freeing that none of the records had a thread in the the epoch, which may make it a little easier to diagnose some improper use if epoch_free() took place while some other thread was inside.

Diff Detail

Repository
rS FreeBSD src repository - subversion
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

kevans requested review of this revision.Nov 5 2020, 4:35 AM
sys/kern/subr_epoch.c
82 ↗(On Diff #79196)

To be honest I think the comment is a bit verbose. /* Used to verify record ownership for non-preemptible epochs. */ would be sufficient IMO.

83 ↗(On Diff #79196)

Did you consider using er_tdlist instead of adding a new field?

468 ↗(On Diff #79196)

The ifdef is not needed.

897 ↗(On Diff #79196)

I don't think this printf is very useful as it is, the debugger will print the same information.

899 ↗(On Diff #79196)

Shouldn't it be MPASS(entered && er->er_td == td)?

sys/kern/subr_epoch.c
83 ↗(On Diff #79196)

I'm open to reconsidering it, but I had kinda ruled it out because it seemed a little clunky -- er_tdlist is a tailq of epoch_trackers, that we don't use for non-preemptible epochs. I hadn't really put any more thought into it then that, to be honest.

899 ↗(On Diff #79196)

I'll make it so after removing the useless printf above -- right now all exits go through here, so !entered was perfectly valid; if we're not printing anything on failure (I can't imagine what we would print) then the previous cases will just return immediately.

sys/kern/subr_epoch.c
83 ↗(On Diff #79196)

It would be somewhat nice to not change struct size depending on the kernel config.

sys/kern/subr_epoch.c
83 ↗(On Diff #79196)

Ah, I had it in my head that er_tdlist is a list of struct thread. I think it's ok then.

I was going to write the same thing about conditionally defined structure members, but in this case it's private to subr_epoch.c so not really a problem.

kevans marked 4 inline comments as done.

Addressed most of the commentary so far; er_td remains there for the time being, but notably epoch_record is internal and not part of KBI that would fluctuate with INVARIANTS.

This revision is now accepted and ready to land.Nov 5 2020, 7:16 PM
sys/kern/subr_epoch.c
869 ↗(On Diff #79224)

Is it useful to enter crit section and assert that er_td != curthread ?

sys/kern/subr_epoch.c
869 ↗(On Diff #79224)

I'm inclined to agree, yes.

sys/kern/subr_epoch.c
869 ↗(On Diff #79224)

On second thought, I guess we have to iterate over all of the cpus and check their records on this epoch to make sure that none of them have er_td == curthread since we weren't in a critical section, but that seems pretty reasonable. The current thread has reason to believe it might be in the epoch, we could catch here if something bogus happens.

Additional assert if we're not in a critical section; presumably we could have migrated, so check if any cpu has this thread erroneously recorded. We're perhaps also pretty likely to hit the epoch_exit() assertion that er->er_td == curthread or the follow-up critical_exit() assert if the caller bungled up critical section handling somewhere and migrated between enter/exit, but they could have also lucked out so it's perhaps best not to rely on that.

This revision now requires review to proceed.Nov 6 2020, 5:26 AM
This revision is now accepted and ready to land.Nov 6 2020, 1:44 PM
kib added inline comments.
sys/kern/subr_epoch.c
877 ↗(On Diff #79251)

BTW I would change MPASS to KASSERT which can provide more information in the message. For instance, the cpu index where the situation occurs is immediately useful.

sys/kern/subr_epoch.c
877 ↗(On Diff #79251)

Sure, I will do this pre-commit, maybe:

"exited critical section in epoch %s, cpu %d"