Gigabeat S: Reduce stalling in the ARMv6 IDCT. Also save one instruction per loop...