Details

Reviewers

jhb
brooks
jrtc27
ngie
vangyzen
lwhsu

Commits

rS368055: Significantly speed up libthr/mutex_test and make more reliable

Summary

Instead of using a simple global++ as the data race, with this change we
performs the increment by loading the global, delaying for a bit and then
storing back the incremented value. If I move the increment outside of the
mutex protected range, I can now see the data race with only 100 iterations
on amd64 in almost all cases. Before such a broken test almost always
passed with < 100,000 iterations and only reliably failed with the current
limit of 10 million.

I noticed this poorly written test because the mutex:mutex{2,3} and
timedmutex:mutex{2,3} tests were always timing out on our CheriBSD Jenkins.
Writing good concurrency tests is hard so I won't attempt to do so, but
this change should make the test more likely to fail if pthread_mutex_lock
is not implemented correctly while also significantly reducing the time it
takes to run these four tests. It will also reduce the time it takes to
perform QEMU RISC-V testsuite runs by almost 40 minutes (out of currently
7 hours).

Diff Detail

Repository

rS FreeBSD src repository - subversion

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

arichardson requested review of this revision.Sep 18 2020, 10:29 AM

arichardson created this revision.

Harbormaster completed remote builds in B33621: Diff 77170.Sep 18 2020, 10:29 AM

These are in contrib/netbsd-tests; will you be submitting this upstream? I note that our vendor copy is ancient; mutex6 was deleted upstream in December 2017, and the workaround for PR 44387 on PowerPC removed in March 2017.

Please add __FreeBSD__ around any test modifications; it makes it easier for folks to upstream the changes as we're 3 years out of date.

This is where the sources live upstream: http://cvsweb.netbsd.org/bsdweb.cgi/src/tests/lib/libpthread/t_mutex.c?only_with_tag=MAIN .

add missing static

Harbormaster completed remote builds in B33852: Diff 77592.Sep 28 2020, 10:20 AM

Submitted upstream as http://gnats.netbsd.org/cgi-bin/query-pr-single.pl?number=55677

Add ugly #ifdef FreeBSD

Harbormaster completed remote builds in B33853: Diff 77593.Sep 28 2020, 10:27 AM

brooks accepted this revision.Sep 29 2020, 10:11 PM

This revision is now accepted and ready to land.Sep 29 2020, 10:11 PM

ngie added inline comments.Sep 30 2020, 9:12 PM

contrib/netbsd-tests/lib/libpthread/t_mutex.c
206 ↗	(On Diff #77593)	This is misindented.
324 ↗	(On Diff #77593)	strto*l is preferred; also: what if the number of iterations was <= 0?
359–362 ↗	(On Diff #77593)	The messages could be a bit more complete. Ideally we shouldn't have to read source code to understand errors.

arichardson marked an inline comment as done.Sep 30 2020, 9:31 PM

arichardson added inline comments.

contrib/netbsd-tests/lib/libpthread/t_mutex.c
359–362 ↗	(On Diff #77593)	Yes that would be nice, but most failures in this test (and most other tests) do require looking at the source code. This error will print "count != -1: <value>" which is marginally better than the existing checks in this file such as `ATF_REQUIRE_EQ(x, 21);` -> "x != 21" `ATF_REQUIRE_EQ((int )joinval, 21);` -> "(int )joinval != 21"

fix indentation and avoid negative num_iterations

This revision now requires review to proceed.Sep 30 2020, 9:39 PM

Harbormaster completed remote builds in B33909: Diff 77705.Sep 30 2020, 9:39 PM

arichardson added inline comments.Oct 1 2020, 8:35 AM

contrib/netbsd-tests/lib/libpthread/t_mutex.c
324 ↗	(On Diff #77593)	I can also drop the getenv part of the patch? This was mostly to allow testing the duration and effectiveness of different numbers of iterations.

@ngie is this okay to commit now?

ping?

ping @ngie ?

Non-blocking thought: can this use sem_post/sem_(timed)?wait instead of spinning in busy-loops waiting for threads to start?

This revision is now accepted and ready to land.Nov 18 2020, 5:51 AM

ngie added inline comments.Nov 18 2020, 5:52 AM

contrib/netbsd-tests/lib/libpthread/t_mutex.c
359–362 ↗	(On Diff #77593)	Printing out the line number might be helpful in this case along with a more humanized description, TBH.

arichardson added inline comments.Nov 26 2020, 11:50 AM

contrib/netbsd-tests/lib/libpthread/t_mutex.c
359–362 ↗	(On Diff #77593)	We could modify ATF_REQUIRE_EQ_MSG to also print the location information?

In D26473#608811, @ngie wrote:

Non-blocking thought: can this use sem_post/sem_(timed)?wait instead of spinning in busy-loops waiting for threads to start?

That is probably nicer, but it adds a dependency on working sem_* implementations. I'd be tempted to keep the busy-wait.

Closed by commit rS368055: Significantly speed up libthr/mutex_test and make more reliable (authored by arichardson). · Explain WhyNov 26 2020, 1:32 PM

This revision was automatically updated to reflect the committed changes.

arichardson added a commit: rS368055: Significantly speed up libthr/mutex_test and make more reliable.

Herald added a subscriber: imp. · View Herald TranscriptNov 26 2020, 1:32 PM

Significantly speed up libthr/mutex_test and make more reliable
ClosedPublic
Actions

Details

Diff Detail

Event Timeline

Revision Contents
Changeset List

Diff 80017

head/contrib/netbsd-tests/lib/libpthread/t_mutex.c

Significantly speed up libthr/mutex_test and make more reliableClosedPublicActions

Details

Diff Detail

Event Timeline

Revision ContentsChangeset List

Diff 80017

head/contrib/netbsd-tests/lib/libpthread/t_mutex.c

Significantly speed up libthr/mutex_test and make more reliable
ClosedPublic
Actions

Revision Contents
Changeset List