Collaboration diagram for SIMD intrinsics interface (simd):

Description

Provides an architecture-independent way of doing SIMD coding.

Overview of the SIMD implementation is provided in Single-instruction Multiple-data (SIMD) coding. The details are documented in gromacs/simd/simd.h and the reference implementation impl_reference.h.

Author: Erik Lindahl erik..nosp@m.lind.nosp@m.ahl@s.nosp@m.cili.nosp@m.felab.nosp@m..se

Namespaces
	gmx
	Generic GROMACS namespace.

SIMD implementation capability definitions
#define	GMX_SIMD 1
	1 if any SIMD support is present, otherwise 0.

#define	GMX_SIMD_HAVE_FLOAT 1
	1 when SIMD float support is present, otherwise 0 More...

#define	GMX_SIMD_HAVE_DOUBLE 1
	1 if SIMD double support is present, otherwise 0

#define	GMX_SIMD_HAVE_LOADU 1
	1 if the SIMD implementation supports unaligned loads, otherwise 0

#define	GMX_SIMD_HAVE_STOREU 1
	1 if the SIMD implementation supports unaligned stores, otherwise 0

#define	GMX_SIMD_HAVE_FMA 0
	1 if the SIMD implementation has fused-multiply add hardware More...

#define	GMX_SIMD_HAVE_LOGICAL 1
	1 if SIMD impl has logical operations on floating-point data, otherwise 0

#define	GMX_SIMD_HAVE_FINT32_EXTRACT 1
	Support for extracting integers from gmx::SimdFInt32 (1/0 for present/absent)

#define	GMX_SIMD_HAVE_FINT32_LOGICAL 1
	1 if SIMD logical ops are supported for gmx::SimdFInt32, otherwise 0

#define	GMX_SIMD_HAVE_FINT32_ARITHMETICS 1
	1 if SIMD arithmetic ops are supported for gmx::SimdFInt32, otherwise 0

#define	GMX_SIMD_HAVE_DINT32_EXTRACT 1
	Support for extracting integer from gmx::SimdDInt32 (1/0 for present/absent)

#define	GMX_SIMD_HAVE_DINT32_LOGICAL 1
	1 if logical operations are supported for gmx::SimdDInt32, otherwise 0

#define	GMX_SIMD_HAVE_DINT32_ARITHMETICS 1
	1 if SIMD arithmetic ops are supported for gmx::SimdDInt32, otherwise 0

#define	GMX_SIMD_HAVE_NATIVE_COPYSIGN_FLOAT 0
	1 if implementation provides single precision copysign() More...

#define	GMX_SIMD_HAVE_NATIVE_RSQRT_ITER_FLOAT 0
	1 if implementation provides single precision 1/sqrt(x) N-R iterations faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_RCP_ITER_FLOAT 0
	1 if implementation provides single precision 1/x N-R iterations faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_LOG_FLOAT 0
	1 if implementation provides single precision log() faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_EXP2_FLOAT 0
	1 if implementation provides single precision exp2() faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_EXP_FLOAT 0
	1 if implementation provides single precision exp() faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_COPYSIGN_DOUBLE 0
	1 if implementation provides double precision copysign() More...

#define	GMX_SIMD_HAVE_NATIVE_RSQRT_ITER_DOUBLE 0
	1 if implementation provides double precision 1/sqrt(x) N-R iterations faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_RCP_ITER_DOUBLE 0
	1 if implementation provides double precision 1/x N-R iterations faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_LOG_DOUBLE 0
	1 if implementation provides double precision log() faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_EXP2_DOUBLE 0
	1 if implementation provides double precision exp2() faster than simd_math.h More...

#define	GMX_SIMD_HAVE_NATIVE_EXP_DOUBLE 0
	1 if implementation provides double precision exp() faster than simd_math.h More...

#define	GMX_SIMD_HAVE_GATHER_LOADU_BYSIMDINT_TRANSPOSE_FLOAT 1
	1 if gmx::gatherLoadUBySimdIntTranspose is present, otherwise 0

#define	GMX_SIMD_HAVE_GATHER_LOADU_BYSIMDINT_TRANSPOSE_DOUBLE 1
	1 if gmx::gatherLoadUBySimdIntTranspose is present, otherwise 0

#define	GMX_SIMD_HAVE_HSIMD_UTIL_FLOAT 1
	1 if float half-register load/store/reduce utils present, otherwise 0

#define	GMX_SIMD_HAVE_HSIMD_UTIL_DOUBLE 1
	1 if double half-register load/store/reduce utils present, otherwise 0

#define	GMX_SIMD_FLOAT_WIDTH 4
	Width of the gmx::SimdFloat datatype.

#define	GMX_SIMD_DOUBLE_WIDTH 4
	Width of the gmx::SimdDouble datatype.

#define	GMX_SIMD_HAVE_4NSIMD_UTIL_FLOAT 1
	1 if float 4xN load utils present, otherwise 0

#define	GMX_SIMD_HAVE_4NSIMD_UTIL_DOUBLE 1
	1 if double 4xN load utils present, otherwise 0

#define	GMX_SIMD4_HAVE_FLOAT 1
	1 if implementation provides gmx::Simd4Float, otherwise 0.

#define	GMX_SIMD4_HAVE_DOUBLE 1
	1 if the implementation provides gmx::Simd4Double, otherwise 0.

#define	GMX_SIMD_FINT32_WIDTH GMX_SIMD_FLOAT_WIDTH
	Width of the gmx::SimdFInt32 datatype.

#define	GMX_SIMD_DINT32_WIDTH GMX_SIMD_DOUBLE_WIDTH
	Width of the gmx::SimdDInt32 datatype.

#define	GMX_SIMD4_WIDTH 4
	The SIMD4 type is always four units wide, but this makes code more explicit.

#define	GMX_SIMD_ALIGNMENT 8
	Required alignment in bytes for aligned load/store (always defined, even without SIMD)

#define	GMX_SIMD_RSQRT_BITS 23
	Accuracy of SIMD 1/sqrt(x) lookup. Used to determine number of iterations.

#define	GMX_SIMD_RCP_BITS 23
	Accuracy of SIMD 1/x lookup. Used to determine number of iterations.

Constant width-4 double precision SIMD types and instructions
static Simd4Double gmx_simdcall	gmx::load4 (const double *m)
	Load 4 double values from aligned memory into SIMD4 variable. More...

static void gmx_simdcall	gmx::store4 (double *m, Simd4Double a)
	Store the contents of SIMD4 double to aligned memory m. More...

static Simd4Double gmx_simdcall	gmx::load4U (const double *m)
	Load SIMD4 double from unaligned memory. More...

static void gmx_simdcall	gmx::store4U (double *m, Simd4Double a)
	Store SIMD4 double to unaligned memory. More...

static Simd4Double gmx_simdcall	gmx::simd4SetZeroD ()
	Set all SIMD4 double elements to 0. More...

static Simd4Double gmx_simdcall	gmx::operator& (Simd4Double a, Simd4Double b)
	Bitwise and for two SIMD4 double variables. More...

static Simd4Double gmx_simdcall	gmx::andNot (Simd4Double a, Simd4Double b)
	Bitwise andnot for two SIMD4 double variables. c=(~a) & b. More...

static Simd4Double gmx_simdcall	gmx::operator\| (Simd4Double a, Simd4Double b)
	Bitwise or for two SIMD4 doubles. More...

static Simd4Double gmx_simdcall	gmx::operator^ (Simd4Double a, Simd4Double b)
	Bitwise xor for two SIMD4 double variables. More...

static Simd4Double gmx_simdcall	gmx::operator+ (Simd4Double a, Simd4Double b)
	Add two double SIMD4 variables. More...

static Simd4Double gmx_simdcall	gmx::operator- (Simd4Double a, Simd4Double b)
	Subtract two SIMD4 variables. More...

static Simd4Double gmx_simdcall	gmx::operator- (Simd4Double a)
	SIMD4 floating-point negate. More...

static Simd4Double gmx_simdcall	gmx::operator* (Simd4Double a, Simd4Double b)
	Multiply two SIMD4 variables. More...

static Simd4Double gmx_simdcall	gmx::fma (Simd4Double a, Simd4Double b, Simd4Double c)
	SIMD4 Fused-multiply-add. Result is a*b+c. More...

static Simd4Double gmx_simdcall	gmx::fms (Simd4Double a, Simd4Double b, Simd4Double c)
	SIMD4 Fused-multiply-subtract. Result is a*b-c. More...

static Simd4Double gmx_simdcall	gmx::fnma (Simd4Double a, Simd4Double b, Simd4Double c)
	SIMD4 Fused-negated-multiply-add. Result is -a*b+c. More...

static Simd4Double gmx_simdcall	gmx::fnms (Simd4Double a, Simd4Double b, Simd4Double c)
	SIMD4 Fused-negated-multiply-subtract. Result is -a*b-c. More...

static Simd4Double gmx_simdcall	gmx::rsqrt (Simd4Double x)
	SIMD4 1.0/sqrt(x) lookup. More...

static Simd4Double gmx_simdcall	gmx::abs (Simd4Double a)
	SIMD4 Floating-point abs(). More...

static Simd4Double gmx_simdcall	gmx::max (Simd4Double a, Simd4Double b)
	Set each SIMD4 element to the largest from two variables. More...

static Simd4Double gmx_simdcall	gmx::min (Simd4Double a, Simd4Double b)
	Set each SIMD4 element to the largest from two variables. More...

static Simd4Double gmx_simdcall	gmx::round (Simd4Double a)
	SIMD4 Round to nearest integer value (in floating-point format). More...

static Simd4Double gmx_simdcall	gmx::trunc (Simd4Double a)
	Truncate SIMD4, i.e. round towards zero - common hardware instruction. More...

static double gmx_simdcall	gmx::dotProduct (Simd4Double a, Simd4Double b)
	Return dot product of two double precision SIMD4 variables. More...

static void gmx_simdcall	gmx::transpose (Simd4Double v0, Simd4Double v1, Simd4Double v2, Simd4Double v3)
	SIMD4 double transpose. More...

static Simd4DBool gmx_simdcall	gmx::operator== (Simd4Double a, Simd4Double b)
	a==b for SIMD4 double More...

static Simd4DBool gmx_simdcall	gmx::operator!= (Simd4Double a, Simd4Double b)
	a!=b for SIMD4 double More...

static Simd4DBool gmx_simdcall	gmx::operator< (Simd4Double a, Simd4Double b)
	a<b for SIMD4 double More...

static Simd4DBool gmx_simdcall	gmx::operator<= (Simd4Double a, Simd4Double b)
	a<=b for SIMD4 double. More...

static Simd4DBool gmx_simdcall	gmx::operator&& (Simd4DBool a, Simd4DBool b)
	Logical and on single precision SIMD4 booleans. More...

static Simd4DBool gmx_simdcall	gmx::operator\|\| (Simd4DBool a, Simd4DBool b)
	Logical or on single precision SIMD4 booleans. More...

static bool gmx_simdcall	gmx::anyTrue (Simd4DBool a)
	Returns non-zero if any of the boolean in SIMD4 a is True, otherwise 0. More...

static Simd4Double gmx_simdcall	gmx::selectByMask (Simd4Double a, Simd4DBool mask)
	Select from single precision SIMD4 variable where boolean is true. More...

static Simd4Double gmx_simdcall	gmx::selectByNotMask (Simd4Double a, Simd4DBool mask)
	Select from single precision SIMD4 variable where boolean is false. More...

static Simd4Double gmx_simdcall	gmx::blend (Simd4Double a, Simd4Double b, Simd4DBool sel)
	Vector-blend SIMD4 selection. More...

static double gmx_simdcall	gmx::reduce (Simd4Double a)
	Return sum of all elements in SIMD4 double variable. More...

Constant width-4 single precision SIMD types and instructions
static Simd4Float gmx_simdcall	gmx::load4 (const float *m)
	Load 4 float values from aligned memory into SIMD4 variable. More...

static void gmx_simdcall	gmx::store4 (float *m, Simd4Float a)
	Store the contents of SIMD4 float to aligned memory m. More...

static Simd4Float gmx_simdcall	gmx::load4U (const float *m)
	Load SIMD4 float from unaligned memory. More...

static void gmx_simdcall	gmx::store4U (float *m, Simd4Float a)
	Store SIMD4 float to unaligned memory. More...

static Simd4Float gmx_simdcall	gmx::simd4SetZeroF ()
	Set all SIMD4 float elements to 0. More...

static Simd4Float gmx_simdcall	gmx::operator& (Simd4Float a, Simd4Float b)
	Bitwise and for two SIMD4 float variables. More...

static Simd4Float gmx_simdcall	gmx::andNot (Simd4Float a, Simd4Float b)
	Bitwise andnot for two SIMD4 float variables. c=(~a) & b. More...

static Simd4Float gmx_simdcall	gmx::operator\| (Simd4Float a, Simd4Float b)
	Bitwise or for two SIMD4 floats. More...

static Simd4Float gmx_simdcall	gmx::operator^ (Simd4Float a, Simd4Float b)
	Bitwise xor for two SIMD4 float variables. More...

static Simd4Float gmx_simdcall	gmx::operator+ (Simd4Float a, Simd4Float b)
	Add two float SIMD4 variables. More...

static Simd4Float gmx_simdcall	gmx::operator- (Simd4Float a, Simd4Float b)
	Subtract two SIMD4 variables. More...

static Simd4Float gmx_simdcall	gmx::operator- (Simd4Float a)
	SIMD4 floating-point negate. More...

static Simd4Float gmx_simdcall	gmx::operator* (Simd4Float a, Simd4Float b)
	Multiply two SIMD4 variables. More...

static Simd4Float gmx_simdcall	gmx::fma (Simd4Float a, Simd4Float b, Simd4Float c)
	SIMD4 Fused-multiply-add. Result is a*b+c. More...

static Simd4Float gmx_simdcall	gmx::fms (Simd4Float a, Simd4Float b, Simd4Float c)
	SIMD4 Fused-multiply-subtract. Result is a*b-c. More...

static Simd4Float gmx_simdcall	gmx::fnma (Simd4Float a, Simd4Float b, Simd4Float c)
	SIMD4 Fused-negated-multiply-add. Result is -a*b+c. More...

static Simd4Float gmx_simdcall	gmx::fnms (Simd4Float a, Simd4Float b, Simd4Float c)
	SIMD4 Fused-negated-multiply-subtract. Result is -a*b-c. More...

static Simd4Float gmx_simdcall	gmx::rsqrt (Simd4Float x)
	SIMD4 1.0/sqrt(x) lookup. More...

static Simd4Float gmx_simdcall	gmx::abs (Simd4Float a)
	SIMD4 Floating-point fabs(). More...

static Simd4Float gmx_simdcall	gmx::max (Simd4Float a, Simd4Float b)
	Set each SIMD4 element to the largest from two variables. More...

static Simd4Float gmx_simdcall	gmx::min (Simd4Float a, Simd4Float b)
	Set each SIMD4 element to the largest from two variables. More...

static Simd4Float gmx_simdcall	gmx::round (Simd4Float a)
	SIMD4 Round to nearest integer value (in floating-point format). More...

static Simd4Float gmx_simdcall	gmx::trunc (Simd4Float a)
	Truncate SIMD4, i.e. round towards zero - common hardware instruction. More...

static float gmx_simdcall	gmx::dotProduct (Simd4Float a, Simd4Float b)
	Return dot product of two single precision SIMD4 variables. More...

static void gmx_simdcall	gmx::transpose (Simd4Float v0, Simd4Float v1, Simd4Float v2, Simd4Float v3)
	SIMD4 float transpose. More...

static Simd4FBool gmx_simdcall	gmx::operator== (Simd4Float a, Simd4Float b)
	a==b for SIMD4 float More...

static Simd4FBool gmx_simdcall	gmx::operator!= (Simd4Float a, Simd4Float b)
	a!=b for SIMD4 float More...

static Simd4FBool gmx_simdcall	gmx::operator< (Simd4Float a, Simd4Float b)
	a<b for SIMD4 float More...

static Simd4FBool gmx_simdcall	gmx::operator<= (Simd4Float a, Simd4Float b)
	a<=b for SIMD4 float. More...

static Simd4FBool gmx_simdcall	gmx::operator&& (Simd4FBool a, Simd4FBool b)
	Logical and on single precision SIMD4 booleans. More...

static Simd4FBool gmx_simdcall	gmx::operator\|\| (Simd4FBool a, Simd4FBool b)
	Logical or on single precision SIMD4 booleans. More...

static bool gmx_simdcall	gmx::anyTrue (Simd4FBool a)
	Returns non-zero if any of the boolean in SIMD4 a is True, otherwise 0. More...

static Simd4Float gmx_simdcall	gmx::selectByMask (Simd4Float a, Simd4FBool mask)
	Select from single precision SIMD4 variable where boolean is true. More...

static Simd4Float gmx_simdcall	gmx::selectByNotMask (Simd4Float a, Simd4FBool mask)
	Select from single precision SIMD4 variable where boolean is false. More...

static Simd4Float gmx_simdcall	gmx::blend (Simd4Float a, Simd4Float b, Simd4FBool sel)
	Vector-blend SIMD4 selection. More...

static float gmx_simdcall	gmx::reduce (Simd4Float a)
	Return sum of all elements in SIMD4 float variable. More...

SIMD predefined macros to describe high-level capabilities
These macros are used to describe the features available in default Gromacs real precision. They are set from the lower-level implementation files that have macros describing single and double precision individually, as well as the implementation details.
#define	GMX_SIMD_HAVE_REAL GMX_SIMD_HAVE_FLOAT
	1 if SimdReal is available, otherwise 0. More...

#define	GMX_SIMD_REAL_WIDTH GMX_SIMD_FLOAT_WIDTH
	Width of SimdReal. More...

#define	GMX_SIMD_HAVE_INT32_EXTRACT GMX_SIMD_HAVE_FINT32_EXTRACT
	1 if support is available for extracting elements from SimdInt32, otherwise 0 More...

#define	GMX_SIMD_HAVE_INT32_LOGICAL GMX_SIMD_HAVE_FINT32_LOGICAL
	1 if logical ops are supported on SimdInt32, otherwise 0. More...

#define	GMX_SIMD_HAVE_INT32_ARITHMETICS GMX_SIMD_HAVE_FINT32_ARITHMETICS
	1 if arithmetic ops are supported on SimdInt32, otherwise 0. More...

#define	GMX_SIMD_HAVE_GATHER_LOADU_BYSIMDINT_TRANSPOSE_REAL GMX_SIMD_HAVE_GATHER_LOADU_BYSIMDINT_TRANSPOSE_FLOAT
	1 if gmx::simdGatherLoadUBySimdIntTranspose is present, otherwise 0 More...

#define	GMX_SIMD_HAVE_HSIMD_UTIL_REAL GMX_SIMD_HAVE_HSIMD_UTIL_FLOAT
	1 if real half-register load/store/reduce utils present, otherwise 0 More...

#define	GMX_SIMD4_HAVE_REAL GMX_SIMD4_HAVE_FLOAT
	1 if Simd4Real is available, otherwise 0. More...

Single precision SIMD math functions
Note In most cases you should use the real-precision functions instead.
static SimdFloat gmx_simdcall	gmx::copysign (SimdFloat x, SimdFloat y)
	Composes floating point value with the magnitude of x and the sign of y. More...

static SimdFloat gmx_simdcall	gmx::rsqrtIter (SimdFloat lu, SimdFloat x)
	Perform one Newton-Raphson iteration to improve 1/sqrt(x) for SIMD float. More...

static SimdFloat gmx_simdcall	gmx::invsqrt (SimdFloat x)
	Calculate 1/sqrt(x) for SIMD float. More...

static void gmx_simdcall	gmx::invsqrtPair (SimdFloat x0, SimdFloat x1, SimdFloat out0, SimdFloat out1)
	Calculate 1/sqrt(x) for two SIMD floats. More...

static SimdFloat gmx_simdcall	gmx::rcpIter (SimdFloat lu, SimdFloat x)
	Perform one Newton-Raphson iteration to improve 1/x for SIMD float. More...

static SimdFloat gmx_simdcall	gmx::inv (SimdFloat x)
	Calculate 1/x for SIMD float. More...

static SimdFloat gmx_simdcall	gmx::operator/ (SimdFloat nom, SimdFloat denom)
	Division for SIMD floats. More...

static SimdFloat	gmx::maskzInvsqrt (SimdFloat x, SimdFBool m)
	Calculate 1/sqrt(x) for masked entries of SIMD float. More...

static SimdFloat gmx_simdcall	gmx::maskzInv (SimdFloat x, SimdFBool m)
	Calculate 1/x for SIMD float, masked version. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdFloat gmx_simdcall	gmx::sqrt (SimdFloat x)
	Calculate sqrt(x) for SIMD floats. More...

static SimdFloat gmx_simdcall	gmx::log (SimdFloat x)
	SIMD float log(x). This is the natural logarithm. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdFloat gmx_simdcall	gmx::exp2 (SimdFloat x)
	SIMD float 2^x. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdFloat gmx_simdcall	gmx::exp (SimdFloat x)
	SIMD float exp(x). More...

static SimdFloat gmx_simdcall	gmx::erf (SimdFloat x)
	SIMD float erf(x). More...

static SimdFloat gmx_simdcall	gmx::erfc (SimdFloat x)
	SIMD float erfc(x). More...

static void gmx_simdcall	gmx::sincos (SimdFloat x, SimdFloat sinval, SimdFloat cosval)
	SIMD float sin & cos. More...

static SimdFloat gmx_simdcall	gmx::sin (SimdFloat x)
	SIMD float sin(x). More...

static SimdFloat gmx_simdcall	gmx::cos (SimdFloat x)
	SIMD float cos(x). More...

static SimdFloat gmx_simdcall	gmx::tan (SimdFloat x)
	SIMD float tan(x). More...

static SimdFloat gmx_simdcall	gmx::asin (SimdFloat x)
	SIMD float asin(x). More...

static SimdFloat gmx_simdcall	gmx::acos (SimdFloat x)
	SIMD float acos(x). More...

static SimdFloat gmx_simdcall	gmx::atan (SimdFloat x)
	SIMD float asin(x). More...

static SimdFloat gmx_simdcall	gmx::atan2 (SimdFloat y, SimdFloat x)
	SIMD float atan2(y,x). More...

static SimdFloat gmx_simdcall	gmx::pmeForceCorrection (SimdFloat z2)
	Calculate the force correction due to PME analytically in SIMD float. More...

static SimdFloat gmx_simdcall	gmx::pmePotentialCorrection (SimdFloat z2)
	Calculate the potential correction due to PME analytically in SIMD float. More...

Double precision SIMD math functions
Note In most cases you should use the real-precision functions instead.
static SimdDouble gmx_simdcall	gmx::copysign (SimdDouble x, SimdDouble y)
	Composes floating point value with the magnitude of x and the sign of y. More...

static SimdDouble gmx_simdcall	gmx::rsqrtIter (SimdDouble lu, SimdDouble x)
	Perform one Newton-Raphson iteration to improve 1/sqrt(x) for SIMD double. More...

static SimdDouble gmx_simdcall	gmx::invsqrt (SimdDouble x)
	Calculate 1/sqrt(x) for SIMD double. More...

static void gmx_simdcall	gmx::invsqrtPair (SimdDouble x0, SimdDouble x1, SimdDouble out0, SimdDouble out1)
	Calculate 1/sqrt(x) for two SIMD doubles. More...

static SimdDouble gmx_simdcall	gmx::rcpIter (SimdDouble lu, SimdDouble x)
	Perform one Newton-Raphson iteration to improve 1/x for SIMD double. More...

static SimdDouble gmx_simdcall	gmx::inv (SimdDouble x)
	Calculate 1/x for SIMD double. More...

static SimdDouble gmx_simdcall	gmx::operator/ (SimdDouble nom, SimdDouble denom)
	Division for SIMD doubles. More...

static SimdDouble	gmx::maskzInvsqrt (SimdDouble x, SimdDBool m)
	Calculate 1/sqrt(x) for masked entries of SIMD double. More...

static SimdDouble gmx_simdcall	gmx::maskzInv (SimdDouble x, SimdDBool m)
	Calculate 1/x for SIMD double, masked version. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdDouble gmx_simdcall	gmx::sqrt (SimdDouble x)
	Calculate sqrt(x) for SIMD doubles. More...

static SimdDouble gmx_simdcall	gmx::log (SimdDouble x)
	SIMD double log(x). This is the natural logarithm. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdDouble gmx_simdcall	gmx::exp2 (SimdDouble x)
	SIMD double 2^x. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdDouble gmx_simdcall	gmx::exp (SimdDouble x)
	SIMD double exp(x). More...

static SimdDouble gmx_simdcall	gmx::erf (SimdDouble x)
	SIMD double erf(x). More...

static SimdDouble gmx_simdcall	gmx::erfc (SimdDouble x)
	SIMD double erfc(x). More...

static void gmx_simdcall	gmx::sincos (SimdDouble x, SimdDouble sinval, SimdDouble cosval)
	SIMD double sin & cos. More...

static SimdDouble gmx_simdcall	gmx::sin (SimdDouble x)
	SIMD double sin(x). More...

static SimdDouble gmx_simdcall	gmx::cos (SimdDouble x)
	SIMD double cos(x). More...

static SimdDouble gmx_simdcall	gmx::tan (SimdDouble x)
	SIMD double tan(x). More...

static SimdDouble gmx_simdcall	gmx::asin (SimdDouble x)
	SIMD double asin(x). More...

static SimdDouble gmx_simdcall	gmx::acos (SimdDouble x)
	SIMD double acos(x). More...

static SimdDouble gmx_simdcall	gmx::atan (SimdDouble x)
	SIMD double asin(x). More...

static SimdDouble gmx_simdcall	gmx::atan2 (SimdDouble y, SimdDouble x)
	SIMD double atan2(y,x). More...

static SimdDouble gmx_simdcall	gmx::pmeForceCorrection (SimdDouble z2)
	Calculate the force correction due to PME analytically in SIMD double. More...

static SimdDouble gmx_simdcall	gmx::pmePotentialCorrection (SimdDouble z2)
	Calculate the potential correction due to PME analytically in SIMD double. More...

SIMD math functions for double prec. data, single prec. accuracy
Note In some cases we do not need full double accuracy of individual SIMD math functions, although the data is stored in double precision SIMD registers. This might be the case for special algorithms, or if the architecture does not support single precision. Since the full double precision evaluation of math functions typically require much more expensive polynomial approximations these functions implement the algorithms used in the single precision SIMD math functions, but they operate on double precision SIMD variables.
static SimdDouble gmx_simdcall	gmx::invsqrtSingleAccuracy (SimdDouble x)
	Calculate 1/sqrt(x) for SIMD double, but in single accuracy. More...

static SimdDouble	gmx::maskzInvsqrtSingleAccuracy (SimdDouble x, SimdDBool m)
	1/sqrt(x) for masked-in entries of SIMD double, but in single accuracy. More...

static void gmx_simdcall	gmx::invsqrtPairSingleAccuracy (SimdDouble x0, SimdDouble x1, SimdDouble out0, SimdDouble out1)
	Calculate 1/sqrt(x) for two SIMD doubles, but single accuracy. More...

static SimdDouble gmx_simdcall	gmx::invSingleAccuracy (SimdDouble x)
	Calculate 1/x for SIMD double, but in single accuracy. More...

static SimdDouble gmx_simdcall	gmx::maskzInvSingleAccuracy (SimdDouble x, SimdDBool m)
	1/x for masked entries of SIMD double, single accuracy. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdDouble gmx_simdcall	gmx::sqrtSingleAccuracy (SimdDouble x)
	Calculate sqrt(x) (correct for 0.0) for SIMD double, with single accuracy. More...

static SimdDouble gmx_simdcall	gmx::logSingleAccuracy (SimdDouble x)
	SIMD log(x). Double precision SIMD data, single accuracy. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdDouble gmx_simdcall	gmx::exp2SingleAccuracy (SimdDouble x)
	SIMD 2^x. Double precision SIMD, single accuracy. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdDouble gmx_simdcall	gmx::expSingleAccuracy (SimdDouble x)
	SIMD exp(x). Double precision SIMD, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::erfSingleAccuracy (SimdDouble x)
	SIMD erf(x). Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::erfcSingleAccuracy (SimdDouble x)
	SIMD erfc(x). Double precision SIMD data, single accuracy. More...

static void gmx_simdcall	gmx::sinCosSingleAccuracy (SimdDouble x, SimdDouble sinval, SimdDouble cosval)
	SIMD sin & cos. Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::sinSingleAccuracy (SimdDouble x)
	SIMD sin(x). Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::cosSingleAccuracy (SimdDouble x)
	SIMD cos(x). Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::tanSingleAccuracy (SimdDouble x)
	SIMD tan(x). Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::asinSingleAccuracy (SimdDouble x)
	SIMD asin(x). Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::acosSingleAccuracy (SimdDouble x)
	SIMD acos(x). Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::atanSingleAccuracy (SimdDouble x)
	SIMD asin(x). Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::atan2SingleAccuracy (SimdDouble y, SimdDouble x)
	SIMD atan2(y,x). Double precision SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::pmeForceCorrectionSingleAccuracy (SimdDouble z2)
	Analytical PME force correction, double SIMD data, single accuracy. More...

static SimdDouble gmx_simdcall	gmx::pmePotentialCorrectionSingleAccuracy (SimdDouble z2)
	Analytical PME potential correction, double SIMD data, single accuracy. More...

SIMD4 math functions
Note Only a subset of the math functions are implemented for SIMD4.
static Simd4Float gmx_simdcall	gmx::rsqrtIter (Simd4Float lu, Simd4Float x)
	Perform one Newton-Raphson iteration to improve 1/sqrt(x) for SIMD4 float. More...

static Simd4Float gmx_simdcall	gmx::invsqrt (Simd4Float x)
	Calculate 1/sqrt(x) for SIMD4 float. More...

static Simd4Double gmx_simdcall	gmx::rsqrtIter (Simd4Double lu, Simd4Double x)
	Perform one Newton-Raphson iteration to improve 1/sqrt(x) for SIMD4 double. More...

static Simd4Double gmx_simdcall	gmx::invsqrt (Simd4Double x)
	Calculate 1/sqrt(x) for SIMD4 double. More...

static Simd4Double gmx_simdcall	gmx::invsqrtSingleAccuracy (Simd4Double x)
	Calculate 1/sqrt(x) for SIMD4 double, but in single accuracy. More...

Classes
class	gmx::Simd4Double
	SIMD4 double type. More...

class	gmx::Simd4DBool
	SIMD4 variable type to use for logical comparisons on doubles. More...

class	gmx::Simd4Float
	SIMD4 float type. More...

class	gmx::Simd4FBool
	SIMD4 variable type to use for logical comparisons on floats. More...

class	gmx::SimdDouble
	Double SIMD variable. Available if GMX_SIMD_HAVE_DOUBLE is 1. More...

class	gmx::SimdDInt32
	Integer SIMD variable type to use for conversions to/from double. More...

class	gmx::SimdDBool
	Boolean type for double SIMD data. More...

class	gmx::SimdDIBool
	Boolean type for integer datatypes corresponding to double SIMD. More...

class	gmx::SimdFloat
	Float SIMD variable. Available if GMX_SIMD_HAVE_FLOAT is 1. More...

class	gmx::SimdFInt32
	Integer SIMD variable type to use for conversions to/from float. More...

class	gmx::SimdFBool
	Boolean type for float SIMD data. More...

class	gmx::SimdFIBool
	Boolean type for integer datatypes corresponding to float SIMD. More...

Functions
template<int align>
static void gmx_simdcall	gmx::gatherLoadTranspose (const double base, const std::int32_t offset[], SimdDouble v0, SimdDouble v1, SimdDouble v2, SimdDouble *v3)
	Load 4 consecutive double from each of GMX_SIMD_DOUBLE_WIDTH offsets, and transpose into 4 SIMD double variables. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadTranspose (const double base, const std::int32_t offset[], SimdDouble v0, SimdDouble *v1)
	Load 2 consecutive double from each of GMX_SIMD_DOUBLE_WIDTH offsets, and transpose into 2 SIMD double variables. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadUTranspose (const double base, const std::int32_t offset[], SimdDouble v0, SimdDouble v1, SimdDouble v2)
	Load 3 consecutive doubles from each of GMX_SIMD_DOUBLE_WIDTH offsets, and transpose into 3 SIMD double variables. More...

template<int align>
static void gmx_simdcall	gmx::transposeScatterStoreU (double *base, const std::int32_t offset[], SimdDouble v0, SimdDouble v1, SimdDouble v2)
	Transpose and store 3 SIMD doubles to 3 consecutive addresses at GMX_SIMD_DOUBLE_WIDTH offsets. More...

template<int align>
static void gmx_simdcall	gmx::transposeScatterIncrU (double *base, const std::int32_t offset[], SimdDouble v0, SimdDouble v1, SimdDouble v2)
	Transpose and add 3 SIMD doubles to 3 consecutive addresses at GMX_SIMD_DOUBLE_WIDTH offsets. More...

template<int align>
static void gmx_simdcall	gmx::transposeScatterDecrU (double *base, const std::int32_t offset[], SimdDouble v0, SimdDouble v1, SimdDouble v2)
	Transpose and subtract 3 SIMD doubles to 3 consecutive addresses at GMX_SIMD_DOUBLE_WIDTH offsets. More...

static void gmx_simdcall	gmx::expandScalarsToTriplets (SimdDouble scalar, SimdDouble triplets0, SimdDouble triplets1, SimdDouble *triplets2)
	Expand each element of double SIMD variable into three identical consecutive elements in three SIMD outputs. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadBySimdIntTranspose (const double base, SimdDInt32 offset, SimdDouble v0, SimdDouble v1, SimdDouble v2, SimdDouble *v3)
	Load 4 consecutive doubles from each of GMX_SIMD_DOUBLE_WIDTH offsets specified by a SIMD integer, transpose into 4 SIMD double variables. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadUBySimdIntTranspose (const double base, SimdDInt32 offset, SimdDouble v0, SimdDouble *v1)
	Load 2 consecutive doubles from each of GMX_SIMD_DOUBLE_WIDTH offsets (unaligned) specified by SIMD integer, transpose into 2 SIMD doubles. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadBySimdIntTranspose (const double base, SimdDInt32 offset, SimdDouble v0, SimdDouble *v1)
	Load 2 consecutive doubles from each of GMX_SIMD_DOUBLE_WIDTH offsets specified by a SIMD integer, transpose into 2 SIMD double variables. More...

static double gmx_simdcall	gmx::reduceIncr4ReturnSum (double *m, SimdDouble v0, SimdDouble v1, SimdDouble v2, SimdDouble v3)
	Reduce each of four SIMD doubles, add those values to four consecutive doubles in memory, return sum. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadTranspose (const float base, const std::int32_t offset[], SimdFloat v0, SimdFloat v1, SimdFloat v2, SimdFloat *v3)
	Load 4 consecutive floats from each of GMX_SIMD_FLOAT_WIDTH offsets, and transpose into 4 SIMD float variables. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadTranspose (const float base, const std::int32_t offset[], SimdFloat v0, SimdFloat *v1)
	Load 2 consecutive floats from each of GMX_SIMD_FLOAT_WIDTH offsets, and transpose into 2 SIMD float variables. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadUTranspose (const float base, const std::int32_t offset[], SimdFloat v0, SimdFloat v1, SimdFloat v2)
	Load 3 consecutive floats from each of GMX_SIMD_FLOAT_WIDTH offsets, and transpose into 3 SIMD float variables. More...

template<int align>
static void gmx_simdcall	gmx::transposeScatterStoreU (float *base, const std::int32_t offset[], SimdFloat v0, SimdFloat v1, SimdFloat v2)
	Transpose and store 3 SIMD floats to 3 consecutive addresses at GMX_SIMD_FLOAT_WIDTH offsets. More...

template<int align>
static void gmx_simdcall	gmx::transposeScatterIncrU (float *base, const std::int32_t offset[], SimdFloat v0, SimdFloat v1, SimdFloat v2)
	Transpose and add 3 SIMD floats to 3 consecutive addresses at GMX_SIMD_FLOAT_WIDTH offsets. More...

template<int align>
static void gmx_simdcall	gmx::transposeScatterDecrU (float *base, const std::int32_t offset[], SimdFloat v0, SimdFloat v1, SimdFloat v2)
	Transpose and subtract 3 SIMD floats to 3 consecutive addresses at GMX_SIMD_FLOAT_WIDTH offsets. More...

static void gmx_simdcall	gmx::expandScalarsToTriplets (SimdFloat scalar, SimdFloat triplets0, SimdFloat triplets1, SimdFloat *triplets2)
	Expand each element of float SIMD variable into three identical consecutive elements in three SIMD outputs. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadBySimdIntTranspose (const float base, SimdFInt32 offset, SimdFloat v0, SimdFloat v1, SimdFloat v2, SimdFloat *v3)
	Load 4 consecutive floats from each of GMX_SIMD_FLOAT_WIDTH offsets specified by a SIMD integer, transpose into 4 SIMD float variables. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadUBySimdIntTranspose (const float base, SimdFInt32 offset, SimdFloat v0, SimdFloat *v1)
	Load 2 consecutive floats from each of GMX_SIMD_FLOAT_WIDTH offsets (unaligned) specified by SIMD integer, transpose into 2 SIMD floats. More...

template<int align>
static void gmx_simdcall	gmx::gatherLoadBySimdIntTranspose (const float base, SimdFInt32 offset, SimdFloat v0, SimdFloat *v1)
	Load 2 consecutive floats from each of GMX_SIMD_FLOAT_WIDTH offsets specified by a SIMD integer, transpose into 2 SIMD float variables. More...

static float gmx_simdcall	gmx::reduceIncr4ReturnSum (float *m, SimdFloat v0, SimdFloat v1, SimdFloat v2, SimdFloat v3)
	Reduce each of four SIMD floats, add those values to four consecutive floats in memory, return sum. More...

static SimdFloat gmx_simdcall	gmx::invsqrtSingleAccuracy (SimdFloat x)
	Calculate 1/sqrt(x) for SIMD float, only targeting single accuracy. More...

static SimdFloat	gmx::maskzInvsqrtSingleAccuracy (SimdFloat x, SimdFBool m)
	Calculate 1/sqrt(x) for masked SIMD floats, only targeting single accuracy. More...

static void gmx_simdcall	gmx::invsqrtPairSingleAccuracy (SimdFloat x0, SimdFloat x1, SimdFloat out0, SimdFloat out1)
	Calculate 1/sqrt(x) for two SIMD floats, only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::invSingleAccuracy (SimdFloat x)
	Calculate 1/x for SIMD float, only targeting single accuracy. More...

static SimdFloat	gmx::maskzInvSingleAccuracy (SimdFloat x, SimdFBool m)
	Calculate 1/x for masked SIMD floats, only targeting single accuracy. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdFloat gmx_simdcall	gmx::sqrtSingleAccuracy (SimdFloat x)
	Calculate sqrt(x) for SIMD float, always targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::logSingleAccuracy (SimdFloat x)
	SIMD float log(x), only targeting single accuracy. This is the natural logarithm. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdFloat gmx_simdcall	gmx::exp2SingleAccuracy (SimdFloat x)
	SIMD float 2^x, only targeting single accuracy. More...

template<MathOptimization opt = MathOptimization::Safe>
static SimdFloat gmx_simdcall	gmx::expSingleAccuracy (SimdFloat x)
	SIMD float e^x, only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::erfSingleAccuracy (SimdFloat x)
	SIMD float erf(x), only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::erfcSingleAccuracy (SimdFloat x)
	SIMD float erfc(x), only targeting single accuracy. More...

static void gmx_simdcall	gmx::sinCosSingleAccuracy (SimdFloat x, SimdFloat sinval, SimdFloat cosval)
	SIMD float sin & cos, only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::sinSingleAccuracy (SimdFloat x)
	SIMD float sin(x), only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::cosSingleAccuracy (SimdFloat x)
	SIMD float cos(x), only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::tanSingleAccuracy (SimdFloat x)
	SIMD float tan(x), only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::asinSingleAccuracy (SimdFloat x)
	SIMD float asin(x), only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::acosSingleAccuracy (SimdFloat x)
	SIMD float acos(x), only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::atanSingleAccuracy (SimdFloat x)
	SIMD float atan(x), only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::atan2SingleAccuracy (SimdFloat y, SimdFloat x)
	SIMD float atan2(y,x), only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::pmeForceCorrectionSingleAccuracy (SimdFloat z2)
	SIMD Analytic PME force correction, only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::pmePotentialCorrectionSingleAccuracy (SimdFloat z2)
	SIMD Analytic PME potential correction, only targeting single accuracy. More...

static Simd4Float gmx_simdcall	gmx::invsqrtSingleAccuracy (Simd4Float x)
	Calculate 1/sqrt(x) for SIMD4 float, only targeting single accuracy. More...

static SimdFloat gmx_simdcall	gmx::iprod (SimdFloat ax, SimdFloat ay, SimdFloat az, SimdFloat bx, SimdFloat by, SimdFloat bz)
	SIMD float inner product of multiple float vectors. More...

static SimdFloat gmx_simdcall	gmx::norm2 (SimdFloat ax, SimdFloat ay, SimdFloat az)
	SIMD float norm squared of multiple vectors. More...

static void gmx_simdcall	gmx::cprod (SimdFloat ax, SimdFloat ay, SimdFloat az, SimdFloat bx, SimdFloat by, SimdFloat bz, SimdFloat cx, SimdFloat cy, SimdFloat *cz)
	SIMD float cross-product of multiple vectors. More...

static SimdDouble gmx_simdcall	gmx::iprod (SimdDouble ax, SimdDouble ay, SimdDouble az, SimdDouble bx, SimdDouble by, SimdDouble bz)
	SIMD double inner product of multiple double vectors. More...

static SimdDouble gmx_simdcall	gmx::norm2 (SimdDouble ax, SimdDouble ay, SimdDouble az)
	SIMD double norm squared of multiple vectors. More...

static void gmx_simdcall	gmx::cprod (SimdDouble ax, SimdDouble ay, SimdDouble az, SimdDouble bx, SimdDouble by, SimdDouble bz, SimdDouble cx, SimdDouble cy, SimdDouble *cz)
	SIMD double cross-product of multiple vectors. More...

static Simd4Float gmx_simdcall	gmx::norm2 (Simd4Float ax, Simd4Float ay, Simd4Float az)
	SIMD4 float norm squared of multiple vectors. More...

static Simd4Double gmx_simdcall	gmx::norm2 (Simd4Double ax, Simd4Double ay, Simd4Double az)
	SIMD4 double norm squared of multiple vectors. More...

Variables
static const int	gmx::c_simdBestPairAlignmentDouble = 2
	Best alignment to use for aligned pairs of double data. More...

static const int	gmx::c_simdBestPairAlignmentFloat = 2
	Best alignment to use for aligned pairs of float data. More...

Directories
directory	simd
	SIMD intrinsics interface (simd)

directory	tests
	Unit tests for SIMD intrinsics interface (simd).

Files
file	impl_reference.h
	Reference SIMD implementation, including SIMD documentation.

file	impl_reference_definitions.h
	Reference SIMD implementation, including SIMD documentation.

file	impl_reference_general.h
	Reference SIMD implementation, general utility functions.

file	impl_reference_simd4_double.h
	Reference implementation, SIMD4 single precision.

file	impl_reference_simd4_float.h
	Reference implementation, SIMD4 single precision.

file	impl_reference_simd_double.h
	Reference implementation, SIMD double precision.

file	impl_reference_simd_float.h
	Reference implementation, SIMD single precision.

file	impl_reference_util_double.h
	Reference impl., higher-level double prec. SIMD utility functions.

file	impl_reference_util_float.h
	Reference impl., higher-level single prec. SIMD utility functions.

file	scalar.h
	Scalar float functions corresponding to GROMACS SIMD functions.

file	scalar_math.h
	Scalar math functions mimicking GROMACS SIMD math functions.

file	scalar_util.h
	Scalar utility functions mimicking GROMACS SIMD utility functions.

file	simd.h
	Definitions, capabilities, and wrappers for SIMD module.

file	simd_math.h
	Math functions for SIMD datatypes.

file	simd_memory.h
	Declares SimdArrayRef.

file	support.h
	Functions to query compiled and supported SIMD architectures.

file	vector_operations.h
	SIMD operations corresponding to Gromacs rvec datatypes.

Macro Definition Documentation

#define GMX_SIMD4_HAVE_REAL GMX_SIMD4_HAVE_FLOAT

1 if Simd4Real is available, otherwise 0.

GMX_SIMD4_HAVE_DOUBLE if GMX_DOUBLE is 1, otherwise GMX_SIMD4_HAVE_FLOAT.

#define GMX_SIMD_HAVE_FLOAT 1

1 when SIMD float support is present, otherwise 0

You should only use this to specifically check for single precision SIMD, support, even when the rest of Gromacs uses double precision.

#define GMX_SIMD_HAVE_FMA 0

1 if the SIMD implementation has fused-multiply add hardware

Note: All the fused multiply-add functions are always available and can be used in any code (by executing separate multiply and add ops), but in a few very tight loops you might be able to save a few instructions with a separate non-FMA code path.

#define GMX_SIMD_HAVE_GATHER_LOADU_BYSIMDINT_TRANSPOSE_REAL GMX_SIMD_HAVE_GATHER_LOADU_BYSIMDINT_TRANSPOSE_FLOAT

1 if gmx::simdGatherLoadUBySimdIntTranspose is present, otherwise 0

GMX_SIMD_HAVE_GATHER_LOADU_BYSIMDINT_TRANSPOSE_DOUBLE if GMX_DOUBLE is 1, otherwise GMX_SIMD_HAVE_GATHER_LOADU_BYSIMDINT_TRANSPOSE_FLOAT.

#define GMX_SIMD_HAVE_HSIMD_UTIL_REAL GMX_SIMD_HAVE_HSIMD_UTIL_FLOAT

1 if real half-register load/store/reduce utils present, otherwise 0

GMX_SIMD_HAVE_HSIMD_UTIL_DOUBLE if GMX_DOUBLE is 1, otherwise GMX_SIMD_HAVE_HSIMD_UTIL_FLOAT.

#define GMX_SIMD_HAVE_INT32_ARITHMETICS GMX_SIMD_HAVE_FINT32_ARITHMETICS

1 if arithmetic ops are supported on SimdInt32, otherwise 0.

GMX_SIMD_HAVE_DINT32_ARITHMETICS if GMX_DOUBLE is 1, otherwise GMX_SIMD_HAVE_FINT32_ARITHMETICS.

#define GMX_SIMD_HAVE_INT32_EXTRACT GMX_SIMD_HAVE_FINT32_EXTRACT

1 if support is available for extracting elements from SimdInt32, otherwise 0

GMX_SIMD_HAVE_DINT32_EXTRACT if GMX_DOUBLE is 1, otherwise GMX_SIMD_HAVE_FINT32_EXTRACT.

#define GMX_SIMD_HAVE_INT32_LOGICAL GMX_SIMD_HAVE_FINT32_LOGICAL

1 if logical ops are supported on SimdInt32, otherwise 0.

GMX_SIMD_HAVE_DINT32_LOGICAL if GMX_DOUBLE is 1, otherwise GMX_SIMD_HAVE_FINT32_LOGICAL.

#define GMX_SIMD_HAVE_NATIVE_COPYSIGN_DOUBLE 0

1 if implementation provides double precision copysign()