Paths

Table of Contentst

proc0_post: Fix some locking issues
ClosedPublic
Actions

Authored by bdrewery on Jun 14 2018, 4:50 PM.

Details

Reviewers

kib
jhb
markj

Commits

rS335183: proc0_post: Fix some locking issues

Summary

Filter out PRS_NEW procs as rufetch() tries taking the thread lock which may not yet be initialized.
Hold PROC_LOCK to ensure stability of iterating the threads.
p_rux fields are protected by the process statlock as well.

More details

This bug

proc0_post iterates FOREACH_PROC_IN_SYSTEM and then calls rufetch(p) which does FOREACH_THREAD_IN_PROC before taking thread lock.

This page faults of the thread lock is not yet initialized for PRS_NEW procs that are in dofork(). In the case I hit it was calling fdcopy() long before the sched_fork() call to initialize the thread ptrs.

None of this code is holding the PROC_LOCK or PROC_SLOCK.

The typical pattern I've seen for dealing with this is to simply filter out PRS_NEW procs but my proposed patch feels very incomplete as rufetch() still has no care about whether the process is PRS_NEW.

rS275121 changed from using the proc slock around the rufetch() call in proc0_post to the proc statlock. It may be enough to use the slock again in rufetch and filter out PRS_NEW procs there but I haven't analyzed it deeply yet.

more proc0_post issues?

The code is still racy in that microuptime(&p2->p_stats->p_start); is called for new forking processes but proc0_post may come along and trash that with a new value, as well as clearing all of its other stats.

Diff Detail

Repository

rS FreeBSD src repository - subversion

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

bdrewery created this revision.Jun 14 2018, 4:50 PM

Harbormaster completed remote builds in B17275: Diff 43771.Jun 14 2018, 4:50 PM

bdrewery edited the summary of this revision. (Show Details)Jun 14 2018, 4:55 PM

bdrewery added reviewers: kib, jhb, markj.

kib added inline comments.Jun 14 2018, 5:25 PM

sys/kern/init_main.c
626 ↗	(On Diff #43771)	I think the process lock scope must be extended to the end of the loop iteration. This would guarantee stability of the iteration on threads.
630 ↗	(On Diff #43771)	p_rux fields which are cleared below are annotated as protected by the proc stat lock, so it makes sense to move the unlock right before FOREACH_THREAD... . This is not to protect updating each rux_ field, but to make the whole change atomic WRT other p_rux observers.
636 ↗	(On Diff #43771)	Thread lock is useless there, so not taking it is fine.

What is the purpose of this code to begin with? It looks like it should just be removed. If it is needed (what for?), it probably has to run after all initial forking is finished.

The thread list traversal is not protected with any relevant locks.

Filter out PRS_NEW procs as rufetch() tries taking the thread lock which may not yet be initialized.
Hold PROC_LOCK to ensure stability of iterating the threads.
p_rux fields are protected by the process statlock as well.

Harbormaster completed remote builds in B17279: Diff 43778.Jun 14 2018, 6:35 PM

bdrewery retitled this revision from proc0_post: Filter out new forking procs. to proc0_post: Fix some locking issues.Jun 14 2018, 6:35 PM

bdrewery edited the summary of this revision. (Show Details)

In D15809#334292, @mjg wrote:

What is the purpose of this code to begin with? It looks like it should just be removed. If it is needed (what for?), it probably has to run after all initial forking is finished.

Code makes consistent early processes start time vs rusage.

kib accepted this revision.Jun 14 2018, 6:38 PM

This revision is now accepted and ready to land.Jun 14 2018, 6:38 PM

Closed by commit rS335183: proc0_post: Fix some locking issues (authored by bdrewery). · Explain WhyJun 15 2018, 12:36 AM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: imp. · View Herald TranscriptJun 15 2018, 12:36 AM

Revision Contents
Changeset List

Path

Size

head/

sys/

kern/

init_main.c

8 lines

Diff 43794

View Options

proc0_post: Fix some locking issuesClosedPublicActions