Blackfin arch: Faster Implementation of csum_tcpudp_nofold()