[ARM][2/4] Fix operand costing logic for SMUL[TB][TB]