SVML-Special Math Functions-XMM#

_mm_svml_ceil_ph#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128h
Param Types:: __m128h a
Param ETypes:: FP16 a

__m128h _mm_svml_ceil_ph(__m128h a);

Intel Description

Round the packed half-precision (16-bit) floating-point elements in “a” up to an integer value, and store the results as packed half-precision floating-point elements in “dst”.

Intel Implementation Psudeo-Code

FOR j := 0 to 7
        i := j*16
        dst[i+15:i] := CEIL(a[i+15:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_floor_ph#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128h
Param Types:: __m128h a
Param ETypes:: FP16 a

__m128h _mm_svml_floor_ph(__m128h a);

Intel Description

Round the packed half-precision (16-bit) floating-point elements in “a” down to an integer value, and store the results as packed half-precision floating-point elements in “dst”.

Intel Implementation Psudeo-Code

FOR j := 0 to 7
        i := j*16
        dst[i+15:i] := FLOOR(a[i+15:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_round_ph#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128h
Param Types:: __m128h a
Param ETypes:: FP16 a

__m128h _mm_svml_round_ph(__m128h a);

Intel Description

Round the packed half-precision (16-bit) floating-point elements in “a” to the nearest integer value, and store the results as packed half-precision floating-point elements in “dst”.

Intel Implementation Psudeo-Code

FOR j := 0 to 7
        i := j*16
        dst[i+15:i] := ROUND(a[i+15:i])
ENDFOR
dst[MAX:128] := 0

_mm_trunc_ph#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128h
Param Types:: __m128h a
Param ETypes:: FP16 a

__m128h _mm_trunc_ph(__m128h a);

Intel Description

Truncate the packed half-precision (16-bit) floating-point elements in “a”, and store the results as packed half-precision floating-point elements in “dst”.

Intel Implementation Psudeo-Code

FOR j := 0 to 7
        i := j*16
        dst[i+15:i] := TRUNCATE(a[i+15:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_ceil_pd#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128d
Param Types:: __m128d a
Param ETypes:: FP64 a

__m128d _mm_svml_ceil_pd(__m128d a);

Intel Description

Round the packed double-precision (64-bit) floating-point elements in “a” up to an integer value, and store the results as packed double-precision floating-point elements in “dst”. This intrinsic may generate the “roundpd”/”vroundpd” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 1
        i := j*64
        dst[i+63:i] := CEIL(a[i+63:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_ceil_ps#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128
Param Types:: __m128 a
Param ETypes:: FP32 a

__m128 _mm_svml_ceil_ps(__m128 a);

Intel Description

Round the packed single-precision (32-bit) floating-point elements in “a” up to an integer value, and store the results as packed single-precision floating-point elements in “dst”. This intrinsic may generate the “roundps”/”vroundps” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 3
        i := j*32
        dst[i+31:i] := CEIL(a[i+31:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_floor_pd#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128d
Param Types:: __m128d a
Param ETypes:: FP64 a

__m128d _mm_svml_floor_pd(__m128d a);

Intel Description

Round the packed double-precision (64-bit) floating-point elements in “a” down to an integer value, and store the results as packed double-precision floating-point elements in “dst”. This intrinsic may generate the “roundpd”/”vroundpd” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 1
        i := j*64
        dst[i+63:i] := FLOOR(a[i+63:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_floor_ps#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128
Param Types:: __m128 a
Param ETypes:: FP32 a

__m128 _mm_svml_floor_ps(__m128 a);

Intel Description

Round the packed single-precision (32-bit) floating-point elements in “a” down to an integer value, and store the results as packed single-precision floating-point elements in “dst”. This intrinsic may generate the “roundps”/”vroundps” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 3
        i := j*32
        dst[i+31:i] := FLOOR(a[i+31:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_round_pd#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128d
Param Types:: __m128d a
Param ETypes:: FP64 a

__m128d _mm_svml_round_pd(__m128d a);

Intel Description

Round the packed double-precision (64-bit) floating-point elements in “a” to the nearest integer value, and store the results as packed double-precision floating-point elements in “dst”. This intrinsic may generate the “roundpd”/”vroundpd” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 1
        i := j*64
        dst[i+63:i] := ROUND(a[i+63:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_round_ps#

Tech:: SVML
Category:: Special Math Functions
Header:: immintrin.h
Searchable:: SVML-Special Math Functions-XMM
Register:: XMM 128 bit
Return Type:: __m128
Param Types:: __m128 a
Param ETypes:: FP32 a

__m128 _mm_svml_round_ps(__m128 a);

Intel Description

Round the packed single-precision (32-bit) floating-point elements in “a” to the nearest integer value, and store the results as packed single-precision floating-point elements in “dst”. This intrinsic may generate the “roundps”/”vroundps” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 3
        i := j*32
        dst[i+31:i] := ROUND(a[i+31:i])
ENDFOR
dst[MAX:128] := 0

SVML-Special Math Functions-XMM

Contents

SVML-Special Math Functions-XMM#

_mm_svml_ceil_ph#

_mm_svml_floor_ph#

_mm_svml_round_ph#

_mm_trunc_ph#

_mm_svml_ceil_pd#

_mm_svml_ceil_ps#

_mm_svml_floor_pd#

_mm_svml_floor_ps#

_mm_svml_round_pd#

_mm_svml_round_ps#