crypto: serpent - add 8-way parallel x86_64/SSE2 assembler implementation