For sake of completeness, here's how the consistent-endian implementation looks like -- it's significantly less intrusive, and generally looks cleaner.
The only thing to check here is the performance effect, and being in beta stage for 12.0 now, I guess I have the time to do just that. Any hints on testing as I'm not really familiar with performance issues?