|
HIP: Heterogenous-computing Interface for Portability
|
Functions | |
| hipError_t | hipHccGetAccelerator (int deviceId, hc::accelerator *acc) |
| Return hc::accelerator associated with the specified deviceId. More... | |
| hipError_t | hipHccGetAcceleratorView (hipStream_t stream, hc::accelerator_view **av) |
| Return hc::accelerator_view associated with the specified stream. More... | |
| hipError_t | hipExtModuleLaunchKernel (hipFunction_t f, uint32_t globalWorkSizeX, uint32_t globalWorkSizeY, uint32_t globalWorkSizeZ, uint32_t localWorkSizeX, uint32_t localWorkSizeY, uint32_t localWorkSizeZ, size_t sharedMemBytes, hipStream_t hStream, void **kernelParams, void **extra, hipEvent_t startEvent=nullptr, hipEvent_t stopEvent=nullptr, uint32_t flags=0) |
| launches kernel f with launch parameters and shared memory on stream with arguments passed to kernelparams or extra More... | |
| hipError_t | hipHccModuleLaunchKernel (hipFunction_t f, uint32_t globalWorkSizeX, uint32_t globalWorkSizeY, uint32_t globalWorkSizeZ, uint32_t localWorkSizeX, uint32_t localWorkSizeY, uint32_t localWorkSizeZ, size_t sharedMemBytes, hipStream_t hStream, void **kernelParams, void **extra, hipEvent_t startEvent=nullptr, hipEvent_t stopEvent=nullptr) __attribute__((deprecated("use hipExtModuleLaunchKernel instead"))) |
| hipError_t hipExtModuleLaunchKernel | ( | hipFunction_t | f, |
| uint32_t | globalWorkSizeX, | ||
| uint32_t | globalWorkSizeY, | ||
| uint32_t | globalWorkSizeZ, | ||
| uint32_t | localWorkSizeX, | ||
| uint32_t | localWorkSizeY, | ||
| uint32_t | localWorkSizeZ, | ||
| size_t | sharedMemBytes, | ||
| hipStream_t | hStream, | ||
| void ** | kernelParams, | ||
| void ** | extra, | ||
| hipEvent_t | startEvent = nullptr, |
||
| hipEvent_t | stopEvent = nullptr, |
||
| uint32_t | flags = 0 |
||
| ) |
launches kernel f with launch parameters and shared memory on stream with arguments passed to kernelparams or extra
| [in[ | f Kernel to launch. | |
| [in] | gridDimX | X grid dimension specified in work-items |
| [in] | gridDimY | Y grid dimension specified in work-items |
| [in] | gridDimZ | Z grid dimension specified in work-items |
| [in] | blockDimX | X block dimensions specified in work-items |
| [in] | blockDimY | Y grid dimension specified in work-items |
| [in] | blockDimZ | Z grid dimension specified in work-items |
| [in] | sharedMemBytes | Amount of dynamic shared memory to allocate for this kernel. The kernel can access this with HIP_DYNAMIC_SHARED. |
| [in] | stream | Stream where the kernel should be dispatched. May be 0, in which case th default stream is used with associated synchronization rules. |
| [in] | kernelParams | |
| [in] | extra | Pointer to kernel arguments. These are passed directly to the kernel and must be in the memory layout and alignment expected by the kernel. |
| [in] | startEvent | If non-null, specified event will be updated to track the start time of the kernel launch. The event must be created before calling this API. |
| [in] | stopEvent | If non-null, specified event will be updated to track the stop time of the kernel launch. The event must be created before calling this API. |
HIP/ROCm actually updates the start event when the associated kernel completes.
| hipError_t hipHccGetAccelerator | ( | int | deviceId, |
| hc::accelerator * | acc | ||
| ) |
Return hc::accelerator associated with the specified deviceId.
| hipError_t hipHccGetAcceleratorView | ( | hipStream_t | stream, |
| hc::accelerator_view ** | av | ||
| ) |
Return hc::accelerator_view associated with the specified stream.
If stream is 0, the accelerator_view for the default stream is returned.
1.8.11