Details

Reviewers

kib
mckusick
chs

Commits

rGa8c732f4e52e: VFS: add retry limit and delay for failed recursive unmounts
rGe81e71b0e9cb: Use interruptible wait for blocking recursive unmounts

Summary

A forcible unmount attempt may fail due to a transient condition, but
it may also fail due to some issue in the filesystem implementation
that will indefinitely prevent successful unmount. In such a case,
the retry logic in the recursive unmount facility will cause the
deferred unmount taskqueue to execute constantly.

Avoid this scenario by imposing a retry limit, with a default value
of 10, beyond which the recursive unmount facility will emit a log
message and give up. Additionally, introduce a grace period, with
a default value of 1s, between successive unmount retries on the
same mount. These values can be tuned through the
vfs.deferred_unmount_retries and vfs.deferred_unmount_retry_delay_hz
sysctls, respectively.

Diff Detail

Repository

rG FreeBSD src repository

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

jah created this revision.Aug 7 2021, 6:34 AM

Herald added a subscriber: imp. · View Herald TranscriptAug 7 2021, 6:34 AM

jah requested review of this revision.Aug 7 2021, 6:34 AM

Harbormaster completed remote builds in B40913: Diff 93364.Aug 7 2021, 6:34 AM

kib accepted this revision.Aug 7 2021, 12:03 PM

kib added inline comments.

sys/kern/vfs_mount.c
106	Did you considered adding a node vfs.deferred_unmount and putting retries and delay_hz under it? It might be interesting to put the total number of failed retries for the whole system lifetime there, as well.
2090	This is arguably a separate change.

This revision is now accepted and ready to land.Aug 7 2021, 12:03 PM

jah added inline comments.Aug 8 2021, 2:31 AM

sys/kern/vfs_mount.c
106	I didn't consider that, but I like the idea.
2090	I probably should have used PCATCH from the beginning, but it seems even more necessary to avoid an unkillable thread now that we can abandon a recursive unmount attempt, so I decided to do it as part of this change.

Add sysctl node for managing deferred unmount behavior

This revision now requires review to proceed.Aug 8 2021, 2:31 AM

Harbormaster completed remote builds in B40921: Diff 93391.Aug 8 2021, 2:32 AM

kib added inline comments.Aug 8 2021, 3:37 AM

sys/kern/vfs_mount.c
2090	I mean that this should be a separate commit.

Split PCATCH into a separate commit

Harbormaster completed remote builds in B40922: Diff 93404.Aug 8 2021, 1:35 PM

jah added inline comments.Aug 8 2021, 1:40 PM

sys/kern/vfs_mount.c
2090	Yes, I know. I was explaining why I didn't make it a separate commit to begin with. I've split it into a separate commit locally now, which I guess doesn't show up in Phabricator. It seemed like a good opportunity to learn how to use 'git add -i'.

Clean up error handling logic, properly release the mount on error in the blocking case

Harbormaster completed remote builds in B40956: Diff 93474.Aug 10 2021, 6:22 AM

kib accepted this revision.Aug 10 2021, 10:40 AM

This revision is now accepted and ready to land.Aug 10 2021, 10:40 AM

Overall looks good. One suggestion, one possible nit.

sys/kern/vfs_mount.c
109	The tick rate can vary between machines and is generally not known. I suggest that you make this in some time units. I suggest seconds would be appropriate.
1968–1984	I think that this statement should be done inside the MNT_ILOCK(mp);

kib added inline comments.Aug 12 2021, 1:46 AM

sys/kern/vfs_mount.c
109	The 'hz' expression guarantees 1 second timeout for taskqueue_enqueue_timeout()

jah added inline comments.Aug 12 2021, 2:58 PM

sys/kern/vfs_mount.c
1968–1984	The deferred_unmount thread is the only thread that will update these fields, so they won't need locking or atomics. Perhaps a comment to that effect would be better instead? (In the dounmount() code below, I do check the retry count while holding a mount interlock, but only because that makes the code to continue the loop slightly cleaner. It also wouldn't be the "right" interlock for synchronization purposes, since the lower mount's interlock is held but the upper mount's retry count is being checked.)

A couple more comments.

sys/kern/vfs_mount.c
109	This is a user-settable variable. If I want to change the default from one second to two seconds, I need to know the hz value to do so. The variable should be in seconds and the variable multiplied by hz where it is used in deferred_unmount_enqueue (where timeout_ticks should be called jjust timeout or perhaps timeout_seconds).
1968–1984	I concur with your argument, though rather than adding an explanation of why bit does not need to be under the lock, it might be simpler to just move it up one line so that it is under the lock (i.e., there is not extra cost since you already take/free the lock).

jah added inline comments.Aug 13 2021, 8:55 PM

sys/kern/vfs_mount.c
109	IMO making the variable be an integer number of seconds would be too coarse. I could delineate the timeout in milliseconds and convert to hz, or instead use taskqueue_enqueue_timeout_sbt(), but to be honest both of those seem like overkill. I think that if a user reaches the point of wanting to tweak this variable, it's probably very easy for them to figure out that they should check kern.hz. I can further ease that discovery by mentioning kern.hz in the sysctl description.
1968–1984	I don't think that would buy anything though. Immediately below this line is a similar non-atomic update of a global variable, and it wouldn't make any sense to move that under a per-mount lock. An explanation would still be useful for that line.

Add comment on (lack of) synchronization for counters, clarify sysctl description

This revision now requires review to proceed.Aug 15 2021, 1:07 AM

Harbormaster completed remote builds in B41045: Diff 93698.Aug 15 2021, 1:08 AM

Sorry for the delay in responding, been vacationing this week. Looks good to go.

This revision is now accepted and ready to land.Aug 20 2021, 12:38 AM

Closed by commit rGe81e71b0e9cb: Use interruptible wait for blocking recursive unmounts (authored by jah). · Explain WhyAug 20 2021, 8:18 PM

This revision was automatically updated to reflect the committed changes.

jah added a commit: rGe81e71b0e9cb: Use interruptible wait for blocking recursive unmounts.

jah added a commit: rGa8c732f4e52e: VFS: add retry limit and delay for failed recursive unmounts.

VFS: add retry limit and delay for failed recursive unmounts
ClosedPublic
Actions

Details

Diff Detail

Event Timeline

Revision Contents
Changeset List

Diff 93974

sys/kern/vfs_mount.c

VFS: add retry limit and delay for failed recursive unmountsClosedPublicActions

Details

Diff Detail

Event Timeline

Revision ContentsChangeset List

Diff 93974

sys/kern/vfs_mount.c

VFS: add retry limit and delay for failed recursive unmounts
ClosedPublic
Actions

Revision Contents
Changeset List