Add an nscache specific inline function to calculate the misalignment
rather than adding and subtracting _ALIGN(p) and p which can take the
buffer far out of bound (undefined behavior in C and unsupported on
CHERI).
Obtained from: CheriBSD
Effort: CHERI upstreaming
Sponsored by: DARPA