SIMD ISAReturn TypeNameArgumentsInstruction Group
Neonfloat32x2_tvfmlal_low_f16(float32x2_t r, float16x4_t a, float16x4_t b)Vector arithmetic / Multiply / Fused multiply-accumulate
Description
Floating-point fused Multiply-Add Long to accumulator (vector). This instruction multiplies corresponding half-precision floating-point values in the vectors in the two source SIMD&FP registers, and accumulates the product to the corresponding vector element of the destination SIMD&FP register. The instruction does not round the result of the multiply before the accumulation.
Results
Vd.2S result
This intrinsic compiles to the following instructions:

FMLAL Vd.2S,Vn.2H,Vm.2H

Argument Preparation
r register: Vd.2Sa b
Architectures
A32, A64

Operation

CheckFPAdvSIMDEnabled64();
bits(datasize DIV 2) operand1 = Vpart[n,part];
bits(datasize DIV 2) operand2 = Vpart[m,part];
bits(datasize) operand3 = V[d];
bits(datasize) result;
bits(esize DIV 2) element1;
bits(esize DIV 2) element2;

for e = 0 to elements-1
    element1 = Elem[operand1, e, esize DIV 2];
    element2 = Elem[operand2, e, esize DIV 2];
    if sub_op then element1 = FPNeg(element1);
    Elem[result,e,esize] = FPMulAddH(Elem[operand3, e, esize], element1, element2, FPCR[]);
V[d] = result;