uma_zalloc_arg/zfree_arg have grown out of control.  I would like to add another new feature and it may require a custom trip through uma_zalloc and I didn't want to add a bunch of new flag checks.
I attempted to make more of a fastpath without a total hack.  This should dramatically reduce the amount of text involved in normal allocations.  I moved some of the zitem checks closer to the allocation so they are more encapsulated.  I still feel like a fair bit of this should be abstracted into inlines for clarity.
This diff should be functionally identical.  I don't think it's perfect but I think it's a step in the right direction.  I will be posting another for uma_zfree_arg() soon.