This makes an optional cross-domain free bucket for memory freed to a different domain than it was allocated from. This gives us support for precise first-touch domains that won't mix memory. If you have a zone which does mix allocs and frees at a significant rate it can eventually bottleneck while freeing memory. I may address that in a follow-up patch if there is enough interest.
Combined with increased thread locality this can offer significant performance improvements for targeted workloads. It is likely not generally faster so it is hidden behind an option for now.