[ARM] Improve csum_fold, cleanup csum_tcpudp_magic()