rtld: actually resolve memcpy plt
ClosedPublic
Actions

Authored by rlibby on Jul 5 2024, 10:22 PM.

Details

Reviewers

Commits

rG39733922edc4: rtld: actually resolve memcpy plt

Summary

The call to memcpy() meant to cause plt resolution in _thr_rtld_init()
was getting optimized by the compiler. Tell the compiler not to use its
memcpy() builtin in thr_rtld.c.

Diff Detail

Repository

rG FreeBSD src repository

Lint

Lint Not Applicable

Unit

Tests Not Applicable

Event Timeline

rlibby created this revision.Jul 5 2024, 10:22 PM

Herald added a subscriber: imp. · View Herald TranscriptJul 5 2024, 10:22 PM

rlibby requested review of this revision.Jul 5 2024, 10:22 PM

Harbormaster completed remote builds in B58529: Diff 140603.Jul 5 2024, 10:22 PM

rlibby added a child revision: D45892: rtld: quiet gcc -Wrestrict.Jul 5 2024, 10:23 PM

Should we disable all built-ins for the file, for the same reasoning?

This revision is now accepted and ready to land.Jul 5 2024, 10:32 PM

In D45891#1046235, @kib wrote:

Should we disable all built-ins for the file, for the same reasoning?

That makes sense to me, for future insurance. I checked just now that -fno-builtin didn't reveal any additional cases. Do you prefer to make that change?

kib: just disable all builtins

This revision now requires review to proceed.Jul 6 2024, 1:10 AM

Harbormaster completed remote builds in B58533: Diff 140614.Jul 6 2024, 1:10 AM

kib accepted this revision.Jul 6 2024, 2:33 AM

kib added inline comments.

lib/libthr/Makefile
35	I would add a comment there, noting that this flag is there for functional purpose, instead of suppressing some warning.

This revision is now accepted and ready to land.Jul 6 2024, 2:33 AM

kib: comment the cflag

This revision now requires review to proceed.Jul 6 2024, 4:54 AM

Harbormaster completed remote builds in B58536: Diff 140617.Jul 6 2024, 4:54 AM

Thank you

This revision is now accepted and ready to land.Jul 6 2024, 7:52 AM

Closed by commit rG39733922edc4: rtld: actually resolve memcpy plt (authored by rlibby). · Explain WhyJul 7 2024, 11:47 PM

This revision was automatically updated to reflect the committed changes.

rlibby added a commit: rG39733922edc4: rtld: actually resolve memcpy plt.

No sure how much the builtins optimization matters here but if it does the other workaround would be:

void* memcpy_for_plt(void*, const void*, size_t) asm("memcpy")

And then call that.

In D45891#1046815, @arichardson wrote:
No sure how much the builtins optimization matters here but if it does the other workaround would be:
void* memcpy_for_plt(void*, const void*, size_t) asm("memcpy")
And then call that.

Thanks for the info. I think @kib could speak better to whether any code in this file could be important in terms of performance. In any case, I believe the one memcpy was the only site actually affected when I examined the codegen for amd64.

In D45891#1046821, @rlibby wrote:
In D45891#1046815, @arichardson wrote:
No sure how much the builtins optimization matters here but if it does the other workaround would be:
void* memcpy_for_plt(void*, const void*, size_t) asm("memcpy")
And then call that.
Thanks for the info. I think @kib could speak better to whether any code in this file could be important in terms of performance. In any case, I believe the one memcpy was the only site actually affected when I examined the codegen for amd64.

lock_acquire/release are the only functions which performance is critical for runtime.

That said, I do believe that correctness trumps speed.

Revision Contents
Changeset List

Path

Size

lib/

libthr/

Makefile

4 lines

Diff 140669

View Options

rtld: actually resolve memcpy pltClosedPublicActions