arm/aarch64: Make the assembly indentation slightly more consistent
The assembly currently uses a mixture of different styles. Don't make all of it entirely consistent now, but try to make functions more consistent within themselves at least.
In particular, get rid of the convention to have braces hanging outside of the alignment line.
Some functions have the whole content indented off by one char compared to other functions; adjust those (but retain the functions that are self-consistent and match either of the common styles).
Also consistently use lowercase vector element specifiers on aarch64.