Page MenuHomeFreeBSD

Improve MD page fault handlers.
ClosedPublic

Authored by kib on Sep 8 2019, 4:27 PM.
Tags
None
Referenced Files
Unknown Object (File)
Sat, Jan 18, 4:35 AM
Unknown Object (File)
Fri, Jan 17, 11:56 AM
Unknown Object (File)
Fri, Jan 3, 7:55 PM
Unknown Object (File)
Thu, Jan 2, 12:03 PM
Unknown Object (File)
Sun, Dec 29, 9:02 AM
Unknown Object (File)
Dec 21 2024, 4:56 PM
Unknown Object (File)
Dec 13 2024, 8:53 PM
Unknown Object (File)
Dec 13 2024, 4:13 PM

Details

Summary

Centralize calculation of signal and ucode delivered on unhandled page fault.

Change the delivered signal for accesses to mapped area after the backing object was truncated.
According to POSIX description for mmap(2),

The system shall always zero-fill any partial page at the end of an object. Further, the system
shall never write out any modified portions of the last page of an object which are beyond its
end. References within the address range starting at pa and continuing for len bytes to whole
pages following the end of an object shall result in delivery of a SIGBUS signal.

An implementation may generate SIGBUS signals when a reference would cause an error in the
mapped object, such as out-of-space condition.

Adjust according to the description, keeping the existing compatibility code for SIGSEGV/SIGBUS on protection failures.

Use the opportunity and move the signal calculation to trap_pfault().

Not sure what better ucode to specify when SIGBUS is delivered for swap reservation failure on CoW.

PR: 211924

Diff Detail

Repository
rS FreeBSD src repository - subversion
Lint
Lint Skipped
Unit
Tests Skipped
Build Status
Buildable 26436

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes
kib edited the summary of this revision. (Show Details)
kib marked 2 inline comments as done.

Handle jilles' notes. Add changes for !x86 (not even compiled).

kib retitled this revision from Change the delivered signal for accesses to mapped area after the backing object was truncated. to Improve MD page fault handlers..Sep 11 2019, 8:18 PM
kib edited the summary of this revision. (Show Details)
sys/amd64/amd64/trap.c
114

The return value is never used.

sys/amd64/amd64/trap.c
348–353

I think that you actually want to use the return value from trap_pfault() here rather than signo.

sys/amd64/amd64/trap.c
800–804

This assignment no longer serves any purpose.

kib marked 3 inline comments as done.

Fix bugs pointed out by alc, for amd64 and i386.

sys/amd64/amd64/trap.c
191

Style: "pf" should be first.

195–196

As an aside, I question if any of these three assignments are needed, even by gcc 4.2.1. Arguably, these assignments are harmful in the respect that they guarantee that the compiler won't generate a warning in case a future change failed to assign a meaningful value to one of these variables and that value was needed, i.e., we were generating a signal.

407

Style: Eliminate the space after the cast.

701

Please consider adding KASSERT((signo != NULL && ucode != NULL) || !usermode, ...) as "documentation" for the new parameters.

749–750

The truncation here is pointless. In fact, the truncation is repeated in vm_fault_trap().

kib marked 5 inline comments as done.Sep 12 2019, 3:57 PM
kib added inline comments.
sys/amd64/amd64/trap.c
195–196

Yes, I believe signo/ucode/addr initializations can be eliminated. But dr6 cannot, because of T_BPTFLT for kernel.

gcc 4.2 should be irrelevant already, I think -Werror was turned off for it recently.

kib marked an inline comment as done.

More changes requested by alc.

sys/amd64/amd64/trap.c
678–689

Please add a comment here describing the meaning of the three possible return values.

sys/amd64/vmm/vmm.c
1414

I would suggest renaming vm_fault_hold() to vm_fault().

sys/arm/arm/trap-v4.c
357

Shouldn't this be error == KERN_SUCCESS

sys/arm64/arm64/trap.c
212–213

Like amd64, the truncation here is pointless.

kib marked 4 inline comments as done.

Rename vm_fault_hold->vm_fault.
Add comment for amd64 trap_pfault.
Minor changes for arm*.

  • Stop overloading KERN_INVALID_ADDRESS for out-of-object error. This breaks other situations, it seems that the best route is to add a new Mach-style error.
  • Switch to use switch in signal decoding in vm_fault_trap().
  • Do not assume that map == kernel_map implies signo != NULL. copyin faults on user address, but in kernel mode.

I imported the test from D21624 to ease testing.

sys/riscv/riscv/trap.c
220–242

I'm surprised to see this snippet brought back to life in the riscv code. We removed it from elsewhere in the source tree almost exactly 4 years ago (r287625).

I would suggest committing the removal of this snippet now, and thus separately from the rest.

sys/vm/vm_fault.c
618

1->0

In D21566#470372, @kib wrote:

... Note that vm_map_wire() use of vm_fault() caused ktrace recording the wired pages, which IMO is not right.

I agree.

kib marked 2 inline comments as done.Sep 13 2019, 7:24 PM

Rebase after riscv commit.

sys/amd64/amd64/trap.c
800–805

si_pkey contains the protection key (between 0 and 15) that caused the fault.

sys/amd64/amd64/trap.c
800–805

I cannot provide this information without race, where userspace might update pk for the faulting address whle handler did not yet fetched the pte.

I think FreeBSD would not provide si_pkey.

Build fixes: tb passes.

Any further comments on the change ?

sys/amd64/amd64/trap.c
341

"Emulator can handle this trap?"

347–348

This assignment could be moved down below the two "if" statements, just before the "break".

682

"If this fault was fatal, typically from kernel mode"

684

"If this fault was handled by updating either the user or kernel page table, execution ..."

686

"If this fault was from user mode and it was not handled, a synchronous signal

sys/i386/i386/trap.c
732–735

The amd64 comment could be used here with the addition of -2 to the list of return values.

sys/vm/vm_fault.c
1078

Can you double-check this case? Does jilles@ comment still apply here? I ask because KERN_INVALID_ADDRESS is going to generate a SIGSEGV, not a SIGBUS.

sys/vm/vm_param.h
116 ↗(On Diff #62097)

I would suggest "KERN_OUT_OF_BOUNDS". I have never seen this phrase used except with the plural "bounds".

kib marked 9 inline comments as done.Sep 22 2019, 7:30 PM
kib added inline comments.
sys/vm/vm_fault.c
1078

I believe that this line change was done before introduction of KERN_OUT_OF_BOUNDS, then the signal generation code was rewritten to accommodate new errors but this line was left behind.

kib marked an inline comment as done.

Latest round of fixes.

Alan, any more comments ? I want to close the review.

In D21566#475488, @kib wrote:

Alan, any more comments ? I want to close the review.

I should be able to look at this change again today.

sys/mips/mips/trap.c
672

Another redundant trunc_page()

709

Another redundant trunc_page()

sys/powerpc/powerpc/trap.c
463

Why pass &sig and &ucode in this case? They don't appear to be used after the call.

756–757

I don't see why we need to do this.

sys/vm/vm_extern.h
90–91

If we are trying to maintain the alphabetical ordering of the declarations in this contiguous block, then this one move down several places.

sys/vm/vm_fault.c
527

Feel free to ignore or defer action on the following comment: To me, the wording of this description suggests that I should set this sysctl to the signal number that I want delivered.

kib marked 6 inline comments as done.Sep 26 2019, 8:23 PM
kib added inline comments.
sys/vm/vm_fault.c
527

s/select/control/

kib marked an inline comment as done.

More alc' notes.

alc added inline comments.
sys/amd64/amd64/trap.c
341

Get rid of the space before the '?'.

sys/arm/arm/trap-v4.c
102

This is now redundant, and can be deleted.

sys/powerpc/powerpc/trap.c
90–92

Consider changing "user" into a "bool".

766

Another redundant trunc_page()

This revision is now accepted and ready to land.Sep 27 2019, 6:26 AM
kib marked 4 inline comments as done.

More notes by alc.

This revision now requires review to proceed.Sep 27 2019, 7:15 AM
This revision is now accepted and ready to land.Sep 27 2019, 2:08 PM
This revision was automatically updated to reflect the committed changes.