public class JCudaKernels extends Object
The utility org.apache.sysml.runtime.instructions.gpu.context.JCudaKernels simplifies the launching of the kernels.
For example: to launch a kernel
copyUpperToLowerTriangleDense<<1,1,32,32>>(jcudaDenseMatrixPtr, dim, dim*dim)
, the user has to call:
kernels.launchKernel("copyUpperToLowerTriangleDense", new ExecutionConfig(1,1,32,32), jcudaDenseMatrixPtr, dim, dim*dim)
Modifier and Type | Method and Description |
---|---|
static void |
checkResult(int cuResult) |
void |
launchKernel(String name,
ExecutionConfig config,
Object... arguments)
Setups the kernel parameters and launches the kernel using cuLaunchKernel API.
|
public void launchKernel(String name, ExecutionConfig config, Object... arguments)
name
- name of the kernelconfig
- execution configurationarguments
- can be of type Pointer, long, double, float and intpublic static void checkResult(int cuResult)
Copyright © 2018 The Apache Software Foundation. All rights reserved.