Ensure that all queue state is cleared when vm_page_dequeue() returns.
ClosedPublic
Actions

Authored by markj on Jun 23 2018, 12:17 PM.

Details

Reviewers

alc
kib
jeff

Commits

rS338276: Ensure that queue state is cleared when vm_page_dequeue() returns.

Summary

Right now, vm_page_dequeue() may be called with or without the page lock
held. If the lock is not held, nothing prevents a different thread from
enqueuing the page before vm_page_dequeue() returns. In practice, this
does not happen because vm_page_dequeue() is called either with the lock
held or from the page allocation functions (so other threads cannot be
manipulating the page).

Suppose m is wired and on a page queue. Suppose the page daemon visits
m and schedules a deferred dequeue. This sets PGA_DEQUEUE and creates a
batch queue entry for the page. A bit later, the batch queue is
processed with only the page queue lock held. This processing will set
m->queue = PQ_NONE, issue a release fence, and clear PGA_DEQUEUE.
Suppose that a thread concurrently unwires the page. It will call
vm_page_dequeue(), which returns if it observes m->queue == PQ_NONE.
Thus, vm_page_dequeue() may return while PGA_DEQUEUE is still set, and
in particular, the thread processing the batch queue will clear all
queue state flags at some point after vm_page_dequeue() returns.

To fix this, make vm_page_dequeue() return only once the concurrent
dequeue has completed. The concurrent dequeue will happen in a critical
section, so we should end up looping for only a short period.

While here, do some related cleanup: inline vm_page_dequeue_locked()
into its only caller and delete a prototype for the unimplemented
vm_page_requeue_locked(). I added a volatile qualifier to "queue" in
r333703; instead, use atomic_load_*() only when specifically needed.

Test Plan

Peter observed the scenario described above in a stress test. The unwiring
thread dequeued the thread; in the subsequent vm_page_enqueue() call,
the assertion (m->aflags & PGA_QUEUE_STATE_MASK) == 0 failed.

An earlier version of this patch seems to have fixed the problem, but I will
ask for a retest if the approach taken here is agreed upon.

Diff Detail

Repository

rS FreeBSD src repository - subversion

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

markj created this revision.Jun 23 2018, 12:17 PM

Harbormaster completed remote builds in B17563: Diff 44344.Jun 23 2018, 12:17 PM

markj edited the test plan for this revision. (Show Details)Jun 23 2018, 12:20 PM

markj added reviewers: alc, kib, jeff.

alc added inline comments.Jul 27 2018, 3:29 AM

sys/vm/vm_page.c
3327 ↗	(On Diff #44344)	Is there a reason why the `pq` assignment isn't inside the body of the below `if` statement?

Move pq assignment.

Harbormaster completed remote builds in B18539: Diff 46216.Aug 2 2018, 9:04 PM

markj marked an inline comment as done.Aug 2 2018, 9:04 PM

markj added inline comments.

sys/vm/vm_page.c
3327 ↗	(On Diff #44344)	No, just an oversight.

Alan, did you have any further thoughts on this patch?

alc added inline comments.Aug 23 2018, 5:45 PM

sys/vm/vm_page.c
3287 ↗	(On Diff #46216)	There is no need for a comma here.
3298 ↗	(On Diff #46216)	There are unnecessary `()`'s around m->order == VM_NFREEORDER.
3312–3315 ↗	(On Diff #46216)	I would suggest adding a block comment here explicitly noting that we are busy waiting and that the thread blocking progress is guaranteed to be in a critical section. Also, perform a cpu_spinwait() to make the busy waiting SMT/hyperthreading friendly.

Functionally, this change looks good to me. Do as you see fit with my earlier comments, and commit the change. l don't feel the need to review it again.

This revision is now accepted and ready to land.Aug 23 2018, 5:56 PM

BTW, my longer-term plan is to combine queue state into a single atomically updated word (today it's in the queue and aflags field). I think that will make reasoning about queue state easier and might make it possible to elide the page lock for most queue operations. I don't have a concrete proposal for this yet though.

Closed by commit rS338276: Ensure that queue state is cleared when vm_page_dequeue() returns. (authored by markj). · Explain WhyAug 23 2018, 8:34 PM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: imp. · View Herald TranscriptAug 23 2018, 8:34 PM