Fix object locking races in swapoff.
ClosedPublic
Actions

Authored by markj on Feb 13 2020, 5:23 PM.

Details

Reviewers

alc
dougm
kib
jeff

Commits

rS358024: Fix object locking races in swapoff(2).

Summary

swap_pager_swapoff_object()'s goal is to allocate pages for all valid
swap blocks belonging to the object, for which there is no resident
page. If the page corresponding to a block is already resident and
valid, the block can simply be discarded.

The existing implementation tries to minimize the number of I/Os used.
For each cluster of swap blocks, it finds maximal runs of valid swap
blocks not resident in memory, and valid resident pages. During this
processing, the object lock may be dropped in several places: when
calling getpages, or when blocking on a busy page in
vm_page_grab_pages(). While the lock is dropped, another thread may
free swap blocks, causing getpages to page in stale data.

Fix the problem following a suggestion from Jeff: use getpages'
readahead capability to perform clustering rather than doing it
ourselves. The simplies the code a bit without reintroducing the old
behaviour of performing one I/O per page.

Diff Detail

Lint

Lint Passed

Unit

No Test Coverage

Build Status

Buildable 29387
Build 27278: arc lint + arc unit

Event Timeline

markj created this revision.Feb 13 2020, 5:23 PM

Harbormaster completed remote builds in B29350: Diff 68256.Feb 13 2020, 5:23 PM

markj added reviewers: alc, dougm, kib, jeff.Feb 13 2020, 5:26 PM

jeff added inline comments.Feb 13 2020, 10:56 PM

sys/vm/swap_pager.c
1780–1803	vm_page_grab_valid() supports a prefetch count that is validated with has_pages. You could likely use it here and simplify this block. You also can test the valid bit with only the object lock and optimize this further by looping with vm_page_next(). I would do a lookup and iteration with vm_page_lookup() and vm_page_next() and only use vm_page_grab_valid() when no page was present. If you don't do this I will do so to fix my kstack problem. I would drop a comment documenting that the valid test depends on the object lock as I have done elsewhere. You will have to add SLEEPFAIL support to grab_valid but that is trivial. I just didn't have a use case before.

markj added inline comments.Feb 14 2020, 3:53 PM

sys/vm/swap_pager.c
1780–1803	After thinking it over some more, I'm not sure that we can avoid busying valid pages here. I think that is the only way we synchronize with a concurrent putpages. Once putpages has allocated swap blocks and submitted I/O to a device, swapoff should wait for that to finish before proceeding. GEOM does some tracking of in-flight requests and should wait for them to drain before destroying a swap device, so that may be sufficient after all, but I need to spend some time to verify that.