target/arm: Implement bfloat16 dot product (indexed)