This cleans up the kernel executable as bootloaders don't have to load AP init code straight to 0xF000, but it will be moved there once kernel is doing the AP initialization.
32 bit did not even support it and 64 bit did not align stack
I don't really want to be working with i386 since it doesn't support compare exchange instruction