Atomic updates to v_wire_count are a significant source of contention,
so help reduce this by decrementing v_wire_count only once when freeing
a set of page table pages.
Details
Diff Detail
- Lint
Lint Passed - Unit
No Test Coverage - Build Status
Buildable 10759 Build 11154: arc lint + arc unit
Event Timeline
sys/amd64/amd64/pmap.c | ||
---|---|---|
2215 | for (count = 0; ((m = SLIST_FIRST()) != NULL; count++) {} ? |
sys/amd64/amd64/pmap.c | ||
---|---|---|
2331 | Is sending an IPI a serializing operation? If so, then doesn't every possible method of TLB invalidation contain at least one serializing operation? If that's the case, then this seems pointless. I think that a comment at the head of the TLB invalidation functions about their serializing effects would be better. |
sys/amd64/amd64/pmap.c | ||
---|---|---|
2331 | What do you mean by pointless ? The rel fence above or the whole change ? rel is only a compiler barrier. That said, sending IPI (as opposed to act on IPI) is not obligatory serializing. I added mfence to x2APIC-mode IPI-send to be compatible with xAPIC mode, since there was some code depending on the effects before IPI visible to IPI handler. But this is not enforced by hardware and might be optimized to avoid fence if possible. |
sys/amd64/amd64/pmap.c | ||
---|---|---|
2331 |
Just the release fence. I'm questioning whether we shouldn't just remove this whole snippet. (There is also the inherent ordering of stores on the x86 to consider.) Otherwise, I like the change. |
sys/amd64/amd64/pmap.c | ||
---|---|---|
2331 | Just an observation ... This release fence is inherited from the i386 pmap, where it is immediately followed by a TLB invalidation on the recursive mapping. |
Address review feedback:
- Use a for-loop in pmap_free_zero_pages().
- Remove fences in _pmap_unwire_ptp(). While ipi_cpu() is not necessarily serializing, x86 has TSO.