In a dual processor system (2*6 cores) during IPSec throughput tests, we see a lot of contention on the arc4 lock, used to generate the IV of the ESP output packets.
The idea of this patch is to split this mutex in order to reduce/kill the contention on this lock.
when ZFS grows the pool, it will add more metaslabs of this same size.
So we want to keep the number of slabs small, so that growing the pool to 100x the size doesn't result in 16,000 metaslabs.
The only downside is, any slack that doesn't fit into a metaslab will be unused, so if we just set a metaslab size of 4G, then we'd only have 8G usable from a 10G image.
Although without knowing how large they are going to grow it, it can be difficult to guess what number is right.