Page MenuHomeFreeBSD

vfs: stop error checking vget(..., LK_RETRY, ...) in vfs_msync
AbandonedPublic

Authored by mjg on Dec 5 2019, 2:37 PM.
Tags
None
Referenced Files
Unknown Object (File)
Thu, Dec 26, 5:29 AM
Unknown Object (File)
Wed, Dec 11, 4:34 PM
Unknown Object (File)
Thu, Dec 5, 5:49 PM
Unknown Object (File)
Nov 22 2024, 12:30 PM
Unknown Object (File)
Nov 17 2024, 1:22 PM
Unknown Object (File)
Nov 12 2024, 6:50 AM
Unknown Object (File)
Sep 29 2024, 7:09 PM
Unknown Object (File)
Sep 24 2024, 6:54 PM
Subscribers

Details

Reviewers
jeff
kib
Summary

If the vnode is doomed the routine should have nothing to do in the loop to begin with. Nonetheless, keep it but remove the check for an always true condition.

Diff Detail

Lint
Lint Skipped
Unit
Tests Skipped

Event Timeline

In fact I am not sure about this revision, as well as with the VNODE_IS_DOOMED(). My main concern is that we allowed to drop the vnode lock in the course of vgone(l)(). With this in mind, I do not see anything wrong, and rather it is useful to clean pages of the vnode if writes are still working, i.e. as far as we did not called RECLAIM.

For VNODE_IS_DOOMED(), you changed the logic significantly if the vnode unlock is possible, before it code relied on the fact that started vgonel() is noted, with VNODE_IS_DOOMED() it would not notice until the vnode is completely dismantled up to the point of resetting v_ops vector.

If you are worried about catching code doing such an unlock, we can add an assertion to unlock that if VI_DOOMED is set, v_op is dead_vnodeops. If want such an unlock to remain legitimate, I can move v_iflag to the beginning of the struct. Right now there is a 4 byte hole in there on amd64 (after v_tag got moved). I did not want to do it because it adds an avoidable constraint on reshuffling of the layout for later.

That said, I'm fine with leaving LK_RETRY in place and removing the error check instead. However, in the current codebase I don't see a case where VI_DOOMED vnode would get returned and v_object would still be set.

Regardless of the above, the current approach to dooming is buggy. Consider a filesystem which uses custom locking routines not based around lockmgr.

  1. the thread which locked the vnode and called vgonel ends up calling lockmgr's unlock fetched from dead_vnodeops. meaning the actual lock may not get proper rollback
  2. a concurrently executing thread doing _vn_lock can get itself into the fs-specific locking routine. it either never gets released, thus the thread gets stuck or it gets released and the thread gets the wrong lock. if LK_RETRY was passed, it proceeds when it should not. if it is not set, it performs the mismatched unlock like the thread above

That being said, I think a dedicated VOP should be provided for filesystems to take care of this (and the rest of vgonel).

In the meantime we can just go with that v_iflag move as least contentious.

mjg retitled this revision from vfs: drop LK_RETRY from vget in vfs_msync to vfs: stop error checking vget(..., LK_RETRY, ...) in vfs_msync.
mjg edited the summary of this revision. (Show Details)
This revision is now accepted and ready to land.Dec 7 2019, 5:32 PM

Still, I think that proposed assert would be very useful and either removes FUD and unblocks some of your patches, or proves the point.

In D22689#497216, @kib wrote:

Still, I think that proposed assert would be very useful and either removes FUD and unblocks some of your patches, or proves the point.

I will add the assert later. Note I created a new review for the vn_lock problem which does not depend on the asserted property. D22715

sys/kern/vfs_subr.c
4355

We're misusing LK_RETRY all over the place IMO and it has lead to an explosion in conditions where we check doomed.

I prefer that we not use it and catch errors. There are many places that specify LK_RETRY and then immediately test DOOMED. This makes no sense. The lock is only supposed to fail if the vnode is doomed.

LK_RETRY was supposed to be a way to specify that the caller had to continue with some vnode even if it was doomed. Largely because we were deep in the stack in a place that expected to maintain a reference to a vnode.

sys/kern/vfs_subr.c
4355

My original patch was dropping LK_RETRY and effectively making the error check usable. I don't have strong opinion which way to go with this one, motivation for the patch was to clean up before other changes in the area.