Add additional hand annotations to various bcopy() variations in the "xscale" version of the ARMv7 memory copying code. Labelled as dubious because these functions don't obey normal conventions for stack use / block structure, and so some of the annotations may behave less well than desirable. However, overall, stack traces in this code are markedly improved.
NB: Not for this change, but .. all of the memory copying code in/around ARMv[67] seems a bit broken. We should replace them all with the ARM-provided reference implementations which are likely faster and better in every way.