Another minor improvement: better pipelining and one less register used in vector...