To be used later in libthr. Note that malloc_aligned() in the rtld/xmalloc.c has additional requirements (offset) and cannot be that easily replaced by __crt_aligned_alloc().
```
rtld_malloc: remove outdated comments
The ovu_magic is not neccessary overlaps with low byte of the ov_next,
for the big endian machines.
There is no range checking in the allocator.
```
```
rtld: remove dup __crt_malloc prototypes
```
```
rtld_malloc: only include internal rtld headers when building for rtld
```
```
rtld_malloc: add cp2op() helper
converting user allocation address into overhead pointer
```
```
rtld_malloc: increase overhead index to uint16
Reorder it with magic, to keep alignment.
```
```
rtld_malloc: add __crt_aligned_alloc()
It is modelled after aligned_alloc(3). Most importantly, to free the
allocation, __crt_free() can be used.
```
```
thr_mutex.c: style
Reindend and re-fill the statement.
```
```
thr_malloc: add __thr_calloc_aligned_cacheline()
```
```
libthr: allocate mutexes aligned on the cache line
```