ULE: set td_oncpu before switching.
ClosedPublic
Actions

Authored by markj on Jan 19 2020, 8:34 PM.

Details

Reviewers

kib
jeff
mjg

Commits

rS357050: Set td_oncpu before dropping the thread lock during a switch.

Summary

This ensures that updates to td_oncpu are synchronized by the thread
lock, as they were before r355784, and like td_state.

Test Plan

I was running procstat -kka in a loop concurrent with a buildkernel loop. Without the patch, stack_save_td_running() generates an unclaimed NMI quite quickly. I will ask Peter to test as well.

Diff Detail

Repository

rS FreeBSD src repository - subversion

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

markj created this revision.Jan 19 2020, 8:34 PM

Harbormaster completed remote builds in B28772: Diff 67018.Jan 19 2020, 8:34 PM

markj mentioned this in D23243: x86: fix a race during running thread stack capture resulting in unclaimed NMI.Jan 19 2020, 8:36 PM

markj edited the test plan for this revision. (Show Details)Jan 19 2020, 9:56 PM

markj added reviewers: kib, jeff, mjg.

kib accepted this revision.Jan 19 2020, 10:27 PM

This revision is now accepted and ready to land.Jan 19 2020, 10:27 PM

In fact, there is at least one place that is affected, I remembered.

Please look at x86/mp_x86.c:smp_after_idle_runnable(). It might cause the code to detect that the initial boot stack for AP is unused while cpu_throw() still not switched to the stack of the idle thread.

In D23270#510267, @kib wrote:

In fact, there is at least one place that is affected, I remembered.

Please look at x86/mp_x86.c:smp_after_idle_runnable(). It might cause the code to detect that the initial boot stack for AP is unused while cpu_throw() still not switched to the stack of the idle thread.

Instead of depending on scheduler internals, what do you think of instead calling smp_rendezvous()? Once it returns we know that all APs have enabled interrupts, and they can only do that after cpu_throw() switches to the idle thread and fork_exit() reenables interrupts.

In D23270#510275, @markj wrote:

Instead of depending on scheduler internals, what do you think of instead calling smp_rendezvous()? Once it returns we know that all APs have enabled interrupts, and they can only do that after cpu_throw() switches to the idle thread and fork_exit() reenables interrupts.

I propose D23276. I do not like so early use of rendezvous, esp. in the situation where the target CPU might be not yet fully initialized. The wait for curthread does not depend on the scheduler at all.

In sched_switch(), update td_oncpu before releasing the thread queue
lock.

This revision now requires review to proceed.Jan 21 2020, 4:18 PM

Harbormaster completed remote builds in B28821: Diff 67104.Jan 21 2020, 4:18 PM

jeff added inline comments.Jan 21 2020, 6:25 PM

sys/kern/sched_ule.c
2147 ↗	(On Diff #67104)	Do we want to also clear this under the scheduler lock?

markj added inline comments.Jan 21 2020, 7:20 PM

sys/kern/sched_ule.c
2147 ↗	(On Diff #67104)	At this point td is still protected by blocked_lock, so I wouldn't think it's necessary. The locking annotation for td_oncpu just says that it is protect by the thread lock.

kib accepted this revision.Jan 21 2020, 8:39 PM

This revision is now accepted and ready to land.Jan 21 2020, 8:39 PM

Closed by commit rS357050: Set td_oncpu before dropping the thread lock during a switch. (authored by markj). · Explain WhyJan 23 2020, 4:25 PM

This revision was automatically updated to reflect the committed changes.

markj added a commit: rS357050: Set td_oncpu before dropping the thread lock during a switch..

Herald added a subscriber: imp. · View Herald TranscriptJan 23 2020, 4:25 PM

I reverted this change. There is a race triggered by sched_bind():

CPU 1:

td (curthread) binds to CPU 2
- In sched_switch(), unlock CPU 1's thread queue, move td to
  CPU 2's runqueue, relock CPU 1's thread queue to choose a new
  thread.  td's lock is the blocked lock and will be changed to CPU's
  tdq lock after the switch.

CPU 2:

idle thread sees load on the CPU's run queue
- Call sched_switch(), lock CPU 2's runqueue, remove td and prepare to
  switch to it.  Set td->td_oncpu = 2 and drop the thread queue lock.

CPU 1:

set td->td_oncpu = NOCPU and call cpu_switch()
- cpu_switch() won't actually switch to td until it is unblocked by
  CPU 1.

So now that we set td_oncpu before switching, we are left running with
td_oncpu == NOCPU. (I reverted the change this morning.) I guess this
means that td_oncpu has to be set only after returning from
cpu_switch(). In particular, td_oncpu is no longer protected by the
thread lock. I will fix some code in stack_machdep.c that was found to
depend on this synchronization, and will spend time auditing the rest
of the td_oncpu accesses.

Revision Contents
Changeset List

Path

Size

head/

sys/

kern/

sched_ule.c

3 lines

Diff 67208

View Options

ULE: set td_oncpu before switching.ClosedPublicActions