vbfdotq_f32
SIMD ISA | Return Type | Name | Arguments | Instruction Group | |
---|---|---|---|---|---|
Neon | float32x4_t | vbfdotq_f32 | (float32x4_t r, bfloat16x8_t a, bfloat16x8_t b) | Vector arithmetic / Dot product | |
Description BFloat16 floating-point dot product (vector). This instruction delimits the source vectors into pairs of 16-bit BF16 elements. Within each pair, the elements in the first source vector are multiplied by the corresponding elements in the second source vector. The resulting single-precision products are then summed and added destructively to the single-precision element of the destination vector that aligns with the pair of BF16 values in the first source vector. The instruction ignores the FPCR and does not update the FPSR exception status. Results Vd.4S result This intrinsic compiles to the following instructions: BFDOT Argument Preparation r register: Vd.4Sa register: Vn.8Hb register: Vm.8H Architectures A32, A64 Operation
|
Copyright © 1995-2025 Arm Limited (or its affiliates). All rights reserved.