A.5.298 SQRTPS: Packed Single-Precision FP Square Root
SQRTPS xmm1,xmm2/m128 ; 0F 51 /r [KATMAI,SSE]
SQRTPS calculates the square root of the packed single-precision FP
value from the source operand, and stores the single-precision results
in the destination register.