SVML-Special Math Functions-XMM#

_mm_svml_ceil_ph#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128h

Param Types:

__m128h a

Param ETypes:

FP16 a

__m128h _mm_svml_ceil_ph(__m128h a);

Intel Description

Round the packed half-precision (16-bit) floating-point elements in “a” up to an integer value, and store the results as packed half-precision floating-point elements in “dst”.

Intel Implementation Psudeo-Code

FOR j := 0 to 7
        i := j*16
        dst[i+15:i] := CEIL(a[i+15:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_floor_ph#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128h

Param Types:

__m128h a

Param ETypes:

FP16 a

__m128h _mm_svml_floor_ph(__m128h a);

Intel Description

Round the packed half-precision (16-bit) floating-point elements in “a” down to an integer value, and store the results as packed half-precision floating-point elements in “dst”.

Intel Implementation Psudeo-Code

FOR j := 0 to 7
        i := j*16
        dst[i+15:i] := FLOOR(a[i+15:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_round_ph#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128h

Param Types:

__m128h a

Param ETypes:

FP16 a

__m128h _mm_svml_round_ph(__m128h a);

Intel Description

Round the packed half-precision (16-bit) floating-point elements in “a” to the nearest integer value, and store the results as packed half-precision floating-point elements in “dst”.

Intel Implementation Psudeo-Code

FOR j := 0 to 7
        i := j*16
        dst[i+15:i] := ROUND(a[i+15:i])
ENDFOR
dst[MAX:128] := 0

_mm_trunc_ph#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128h

Param Types:

__m128h a

Param ETypes:

FP16 a

__m128h _mm_trunc_ph(__m128h a);

Intel Description

Truncate the packed half-precision (16-bit) floating-point elements in “a”, and store the results as packed half-precision floating-point elements in “dst”.

Intel Implementation Psudeo-Code

FOR j := 0 to 7
        i := j*16
        dst[i+15:i] := TRUNCATE(a[i+15:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_ceil_pd#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128d

Param Types:

__m128d a

Param ETypes:

FP64 a

__m128d _mm_svml_ceil_pd(__m128d a);

Intel Description

Round the packed double-precision (64-bit) floating-point elements in “a” up to an integer value, and store the results as packed double-precision floating-point elements in “dst”. This intrinsic may generate the “roundpd”/”vroundpd” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 1
        i := j*64
        dst[i+63:i] := CEIL(a[i+63:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_ceil_ps#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128

Param Types:

__m128 a

Param ETypes:

FP32 a

__m128 _mm_svml_ceil_ps(__m128 a);

Intel Description

Round the packed single-precision (32-bit) floating-point elements in “a” up to an integer value, and store the results as packed single-precision floating-point elements in “dst”. This intrinsic may generate the “roundps”/”vroundps” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 3
        i := j*32
        dst[i+31:i] := CEIL(a[i+31:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_floor_pd#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128d

Param Types:

__m128d a

Param ETypes:

FP64 a

__m128d _mm_svml_floor_pd(__m128d a);

Intel Description

Round the packed double-precision (64-bit) floating-point elements in “a” down to an integer value, and store the results as packed double-precision floating-point elements in “dst”. This intrinsic may generate the “roundpd”/”vroundpd” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 1
        i := j*64
        dst[i+63:i] := FLOOR(a[i+63:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_floor_ps#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128

Param Types:

__m128 a

Param ETypes:

FP32 a

__m128 _mm_svml_floor_ps(__m128 a);

Intel Description

Round the packed single-precision (32-bit) floating-point elements in “a” down to an integer value, and store the results as packed single-precision floating-point elements in “dst”. This intrinsic may generate the “roundps”/”vroundps” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 3
        i := j*32
        dst[i+31:i] := FLOOR(a[i+31:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_round_pd#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128d

Param Types:

__m128d a

Param ETypes:

FP64 a

__m128d _mm_svml_round_pd(__m128d a);

Intel Description

Round the packed double-precision (64-bit) floating-point elements in “a” to the nearest integer value, and store the results as packed double-precision floating-point elements in “dst”. This intrinsic may generate the “roundpd”/”vroundpd” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 1
        i := j*64
        dst[i+63:i] := ROUND(a[i+63:i])
ENDFOR
dst[MAX:128] := 0

_mm_svml_round_ps#

Tech:

SVML

Category:

Special Math Functions

Header:

immintrin.h

Searchable:

SVML-Special Math Functions-XMM

Register:

XMM 128 bit

Return Type:

__m128

Param Types:

__m128 a

Param ETypes:

FP32 a

__m128 _mm_svml_round_ps(__m128 a);

Intel Description

Round the packed single-precision (32-bit) floating-point elements in “a” to the nearest integer value, and store the results as packed single-precision floating-point elements in “dst”. This intrinsic may generate the “roundps”/”vroundps” instruction.

Intel Implementation Psudeo-Code

FOR j := 0 to 3
        i := j*32
        dst[i+31:i] := ROUND(a[i+31:i])
ENDFOR
dst[MAX:128] := 0