Fixed nbnxn_4xN performance regression