Gromacs  2021-beta2-UNCHECKED
 All Classes Namespaces Files Functions Variables Typedefs Enumerations Enumerator Friends Macros Groups Pages
Functions
device_management.h File Reference
#include <memory>
#include <string>
#include <vector>
#include "gromacs/utility/basedefinitions.h"
#include "gromacs/utility/iserializer.h"
+ Include dependency graph for device_management.h:
+ This graph shows which files directly or indirectly include this file:

Description

Declares functions to manage GPU resources.

This has several implementations: one for each supported GPU platform, and a stub implementation if the build does not support GPUs.

Author
Anca Hamuraru anca@.nosp@m.stre.nosp@m.amcom.nosp@m.puti.nosp@m.ng.eu
Dimitrios Karkoulis dimit.nosp@m.ris..nosp@m.karko.nosp@m.ulis.nosp@m.@gmai.nosp@m.l.co.nosp@m.m
Teemu Virolainen teemu.nosp@m.@str.nosp@m.eamco.nosp@m.mput.nosp@m.ing.e.nosp@m.u
Mark Abraham mark..nosp@m.j.ab.nosp@m.raham.nosp@m.@gma.nosp@m.il.co.nosp@m.m
Szilárd Páll pall..nosp@m.szil.nosp@m.ard@g.nosp@m.mail.nosp@m..com
Artem Zhmurov zhmur.nosp@m.ov@g.nosp@m.mail..nosp@m.com

Functions

enum DeviceVendor int bool canPerformDeviceDetection (std::string *errorMessage)
 Return whether GPUs can be detected. More...
 
bool isDeviceDetectionEnabled ()
 Return whether GPU detection is enabled. More...
 
bool isDeviceDetectionFunctional (std::string *errorMessage)
 Return whether GPU detection is functioning correctly. More...
 
DeviceVendor getDeviceVendor (const char *vendorName)
 Returns an DeviceVendor value corresponding to the input OpenCL vendor name. More...
 
std::vector< std::unique_ptr
< DeviceInformation > > 
findDevices ()
 Find all GPUs in the system. More...
 
std::vector
< std::reference_wrapper
< DeviceInformation > > 
getCompatibleDevices (const std::vector< std::unique_ptr< DeviceInformation >> &deviceInfoList)
 Return a container of device-information handles that are compatible. More...
 
std::vector< int > getCompatibleDeviceIds (const std::vector< std::unique_ptr< DeviceInformation >> &deviceInfoList)
 Return a container of the IDs of the compatible GPU ids. More...
 
bool deviceIdIsCompatible (const std::vector< std::unique_ptr< DeviceInformation >> &deviceInfoList, int deviceId)
 Return whether deviceId is found in deviceInfoList and is compatible. More...
 
void setActiveDevice (const DeviceInformation &deviceInfo)
 Set the active GPU. More...
 
void releaseDevice (DeviceInformation *deviceInfo)
 Releases the GPU device used by the active context at the time of calling (CUDA only). More...
 
std::string getDeviceInformationString (const DeviceInformation &deviceInfo)
 Formats and returns a device information string for a given GPU. More...
 
std::string getDeviceCompatibilityDescription (const std::vector< std::unique_ptr< DeviceInformation >> &deviceInfoList, int deviceId)
 Return a string describing how compatible the GPU with given deviceId is. More...
 
void serializeDeviceInformations (const std::vector< std::unique_ptr< DeviceInformation >> &deviceInfoList, gmx::ISerializer *serializer)
 Serialization of information on devices for MPI broadcasting. More...
 
std::vector< std::unique_ptr
< DeviceInformation > > 
deserializeDeviceInformations (gmx::ISerializer *serializer)
 Deserialization of information on devices after MPI broadcasting. More...
 

Function Documentation

enum DeviceVendor int bool canPerformDeviceDetection ( std::string *  errorMessage)
strong

Return whether GPUs can be detected.

Returns true when this is a build of GROMACS configured to support GPU usage, GPU detection is not disabled by GMX_DISABLE_GPU_DETECTION environment variable and a valid device driver, ICD, and/or runtime was detected. Does not throw.

Parameters
[out]errorMessageWhen returning false on a build configured with GPU support and non-nullptr was passed, the string contains a descriptive message about why GPUs cannot be detected.
std::vector<std::unique_ptr<DeviceInformation> > deserializeDeviceInformations ( gmx::ISerializer serializer)

Deserialization of information on devices after MPI broadcasting.

Parameters
[in]serializerSerializing object.
Returns
deviceInfoList Deserialized vector with device informations.
bool deviceIdIsCompatible ( const std::vector< std::unique_ptr< DeviceInformation >> &  deviceInfoList,
int  deviceId 
)

Return whether deviceId is found in deviceInfoList and is compatible.

This function filters the result of the detection for compatible GPUs, based on the previously run compatibility tests.

Parameters
[in]deviceInfoListAn information on available devices.
[in]deviceIdThe device ID to find in the list.
Exceptions
RangeErrorIf deviceId does not match the id of any device in deviceInfoList
Returns
Whether deviceId is compatible.
std::vector<std::unique_ptr<DeviceInformation> > findDevices ( )

Find all GPUs in the system.

Will detect every GPU supported by the device driver in use. Must only be called if canPerformDeviceDetection() has returned true. This routine also checks for the compatibility of each device and fill the deviceInfo array with the required information on each device: ID, device properties, status.

Note that this function leaves the GPU runtime API error state clean; this is implemented ATM in the CUDA flavor. This invalidates any existing CUDA streams, allocated memory on GPU, etc.

Todo:
: Check if errors do propagate in OpenCL as they do in CUDA and whether there is a mechanism to "clear" them.
Returns
Standard vector with the list of devices found
Exceptions
InternalErrorif a GPU API returns an unexpected failure (because the call to canDetectGpus() should always prevent this occuring)
std::vector<int> getCompatibleDeviceIds ( const std::vector< std::unique_ptr< DeviceInformation >> &  deviceInfoList)

Return a container of the IDs of the compatible GPU ids.

This function filters the result of the detection for compatible GPUs, based on the previously run compatibility tests.

Parameters
[in]deviceInfoListAn information on available devices.
Returns
Vector of compatible GPU ids.
std::vector<std::reference_wrapper<DeviceInformation> > getCompatibleDevices ( const std::vector< std::unique_ptr< DeviceInformation >> &  deviceInfoList)

Return a container of device-information handles that are compatible.

This function filters the result of the detection for compatible GPUs, based on the previously run compatibility tests.

Parameters
[in]deviceInfoListAn information on available devices.
Returns
Vector of DeviceInformations on GPUs recorded as compatible
std::string getDeviceCompatibilityDescription ( const std::vector< std::unique_ptr< DeviceInformation >> &  deviceInfoList,
int  deviceId 
)

Return a string describing how compatible the GPU with given deviceId is.

Parameters
[in]deviceInfoListAn information on available devices.
[in]deviceIdAn index of the device to check
Returns
A string describing the compatibility status, useful for error messages.
std::string getDeviceInformationString ( const DeviceInformation deviceInfo)

Formats and returns a device information string for a given GPU.

Given an index directly into the array of available GPUs, returns a formatted info string for the respective GPU which includes ID, name, compute capability, and detection status.

Parameters
[in]deviceInfoAn information on device that is to be set.
Returns
A string describing the device.
DeviceVendor getDeviceVendor ( const char *  vendorName)

Returns an DeviceVendor value corresponding to the input OpenCL vendor name.

Returns
DeviceVendor value for the input vendor name
bool isDeviceDetectionEnabled ( )

Return whether GPU detection is enabled.

Returns true when this is a build of GROMACS configured to support GPU usage and GPU detection is not disabled by GMX_DISABLE_GPU_DETECTION environment variable.

Does not throw.

bool isDeviceDetectionFunctional ( std::string *  errorMessage)

Return whether GPU detection is functioning correctly.

Returns true when this is a build of GROMACS configured to support GPU usage, and a valid device driver, ICD, and/or runtime was detected.

This function is not intended to be called from build configurations that do not support GPUs, and there will be no descriptive message in that case.

Parameters
[out]errorMessageWhen returning false on a build configured with GPU support and non-nullptr was passed, the string contains a descriptive message about why GPUs cannot be detected.

Does not throw.

void releaseDevice ( DeviceInformation deviceInfo)

Releases the GPU device used by the active context at the time of calling (CUDA only).

If deviceInfo is nullptr, then it is understood that no device was selected so no context is active to be freed. Otherwise, the context is explicitly destroyed and therefore all data uploaded to the GPU is lost. This must only be called when none of this data is required anymore, because subsequent attempts to free memory associated with the context will otherwise fail.

Calls gmx_warning upon errors.

Todo:
This should go through all the devices, not only the one currently active. Reseting only one device will not work, e.g. in CUDA tests.
Parameters
[in]deviceInfoInformation on the device to be released.
void serializeDeviceInformations ( const std::vector< std::unique_ptr< DeviceInformation >> &  deviceInfoList,
gmx::ISerializer serializer 
)

Serialization of information on devices for MPI broadcasting.

Parameters
[in]deviceInfoListThe vector with device informations to serialize.
[in]serializerSerializing object.
void setActiveDevice ( const DeviceInformation deviceInfo)

Set the active GPU.

This sets the device for which the device information is passed active. Essential in CUDA, where the device buffers and kernel launches are not connected to the device context. In OpenCL, checks the device vendor and makes vendor-specific performance adjustments.

Parameters
[in]deviceInfoInformation on the device to be set.

Issues a fatal error for any critical errors that occur during initialization.