amd64 csu: microoptimize startup
Reorder operations to reduce moving the values among registers.
Use the LEA instruction to combine arithmetic with the moves.
Reviewed by: markj
Sponsored by: The FreeBSD Foundation
MFC after: 1 week
Differential revision: https://reviews.freebsd.org/D49206