Page MenuHomeFreeBSD

LinuxKPI: make __kmalloc() play by the rules
ClosedPublic

Authored by bz on Sep 12 2024, 6:25 PM.
Tags
None
Referenced Files
Unknown Object (File)
Fri, Oct 24, 11:08 AM
Unknown Object (File)
Mon, Oct 20, 12:30 AM
Unknown Object (File)
Sat, Oct 11, 4:52 AM
Unknown Object (File)
Tue, Oct 7, 1:52 AM
Unknown Object (File)
Mon, Oct 6, 12:06 PM
Unknown Object (File)
Mon, Oct 6, 8:12 AM
Unknown Object (File)
Sun, Oct 5, 7:50 PM
Unknown Object (File)
Sun, Sep 28, 6:23 PM
Subscribers

Details

Summary

According to Documentation/core-api/dma-api.rst kmalloc() is supposd
to provide physically contiguous memory. [1]

In order to guarantee that allocations are contiguous even if using
larger than PAGE_SIZE check the size and use contigmalloc if needed.
This makes use of 9e6544dd6e02 (and following) allowing free(9) to
also work for contigmalloced memory.

Sponsored by: The FreeBSD Foundation
Pointed out by: jhb [1]
MFC after: 3 days

Diff Detail

Repository
rG FreeBSD src repository
Lint
Lint Passed
Unit
No Test Coverage
Build Status
Buildable 59463
Build 56350: arc lint + arc unit

Event Timeline

bz requested review of this revision.Sep 12 2024, 6:25 PM
bz edited reviewers, added: jhb; removed: jh.Sep 12 2024, 6:28 PM

Looks reasonable to me. Aside, it seems we don't document boundary=0 in contigmalloc(9).

Looks reasonable to me. Aside, it seems we don't document boundary=0 in contigmalloc(9).

Yes, though most drivers in sys/dev do not set it; vm_page_alloc_contig documents it as

"If the boundary
parameter is non-zero, the pages constituting the run will not cross a
physical address that is a multiple of the parameter value, which must be
a power of two."

I followed a very long call chain to arrive at vm_addr_bound_ok, which confirmed -- and the nice thing is boundary == 0 doesn't even need to be a special case:

/*
 * Do the first and last addresses of a range match in all bits except the ones
 * in -boundary (a power-of-two)?  For boundary == 0, all addresses match.
 */
static inline bool
vm_addr_bound_ok(vm_paddr_t pa, vm_paddr_t size, vm_paddr_t boundary)
{                               
        KASSERT(powerof2(boundary), ("%s: boundary is not a power of 2: %#jx",
            __func__, (uintmax_t)boundary));
        return (((pa ^ (pa + size - 1)) & -boundary) == 0);
}

If I have a chance I'll propose a contigmalloc(9) addition.

sys/compat/linuxkpi/common/src/linux_slab.c
221

is this needed? (either for contigmalloc, or to match Linux KPI?)

Looking, kmem_alloc_contig_domain does asize = round_page(size); so unless we need it to be explicit about Linux behaviour perhaps just passing _s through is sensible?

bz marked an inline comment as done.Sep 12 2024, 9:03 PM
bz added inline comments.
sys/compat/linuxkpi/common/src/linux_slab.c
221

[ and so does contigmalloc itself: malloc_type_allocated(type, round_page(size)); ]

I think this is no longer needed; I'll remove it.

bz marked an inline comment as done.

simplify as @emaste pointed out; we no longer need to round up to page size.

LGTM, we can wait for @jhb as well though

This revision is now accepted and ready to land.Sep 13 2024, 1:06 AM
sys/compat/linuxkpi/common/src/linux_slab.c
218–224

Given we have some traction on these reviews and I looked again; this should be <= as 0..PAGE_SIZE is the first page and a single page. @emaste @jhb

Note my comment in the other review. The kmalloc_node family of functions needs the same fix.

This revision was automatically updated to reflect the committed changes.

It looks like <= PAGE_SIZE would indeed be correct

It looks like <= PAGE_SIZE would indeed be correct

I'll go and adjust it.. but it shouldn't make a difference to David's problem.
I do not understand yet how this particular change can cause a regression. I kept staring if I got the contigmalloc arguments wrong and I wonder if something else before just worked by accident and now gets served from a different memory region. Swapping malloc/contigmalloc should not be anything noticeable to a drm driver per-se.

Still contemplating to back it out until the problem is understood...