Rearrange loop structure for approx. 35-50% faster calc_transform_coeffs_cpl()