For normal non-cacheable memory ACE supports 4x128 bit r/w WRAP
transfers or 1x128 bit r/w INCR transfers. By re-ordering the
stp's in memcpy / memmove we can accomodate this better without
impacting the existing code.
This fixes an issue seen on multiple Cortex-A72 SOCs when writing
directly to a PCIe memmapped frame-buffer, which resulted in
corruption.
Originally by Jon Nettleton <jon@solid-run.com>
https://gist.github.com/jnettlet/80f8d09d01c0dc0ffc0122f36ed78de6
https://gist.github.com/jnettlet/f6f8b49bb7c731255c46f541f875f436