public class LibMatrixCuMatMult extends LibMatrixCUDA
cudaSupportFunctions, CUDNN_DATA_TYPE, customKernelSuffix, maxNumElementsOfCuDNNTensor, sizeOfDataType
Constructor and Description |
---|
LibMatrixCuMatMult() |
Modifier and Type | Method and Description |
---|---|
static org.apache.sysml.runtime.controlprogram.caching.MatrixObject |
matmult(org.apache.sysml.runtime.controlprogram.context.ExecutionContext ec,
GPUContext gCtx,
String instName,
org.apache.sysml.runtime.controlprogram.caching.MatrixObject left,
org.apache.sysml.runtime.controlprogram.caching.MatrixObject right,
String outputName,
boolean isLeftTransposed,
boolean isRightTransposed)
Matrix multiply on GPU Examines sparsity and shapes and routes call to
appropriate method from cuBLAS or cuSparse C = op(A) x op(B)
The user is expected to call
ec.releaseMatrixOutputForGPUInstruction(outputName);
|
abs, acos, asin, atan, axpy, biasAdd, biasMultiply, cbind, ceil, channelSums, computeNNZ, cos, cosh, dataTypePointerTo, denseTranspose, deviceCopy, double2float, exp, float2double, floor, getCublasHandle, getCudaKernels, getCusparseHandle, getDenseMatrixOutputForGPUInstruction, getDensePointer, getNnz, getSparsePointer, isInSparseFormat, log, matmultTSMM, matrixMatrixArithmetic, matrixMatrixRelational, matrixScalarArithmetic, matrixScalarOp, matrixScalarRelational, one, rbind, reluBackward, resetFloatingPointPrecision, round, sigmoid, sign, sin, sinh, sliceDenseDense, sliceOperations, sliceSparseDense, solve, sqrt, tan, tanh, toInt, transpose, unaryAggregate, zero
public static org.apache.sysml.runtime.controlprogram.caching.MatrixObject matmult(org.apache.sysml.runtime.controlprogram.context.ExecutionContext ec, GPUContext gCtx, String instName, org.apache.sysml.runtime.controlprogram.caching.MatrixObject left, org.apache.sysml.runtime.controlprogram.caching.MatrixObject right, String outputName, boolean isLeftTransposed, boolean isRightTransposed)
ec
- Current ExecutionContext
instancegCtx
- a valid GPUContext
instName
- name of the invoking instruction to recordStatistics
.left
- Matrix Aright
- Matrix BoutputName
- Name of the output matrix C (in code generated after LOP
layer)isLeftTransposed
- op for A, transposed or notisRightTransposed
- op for B, tranposed or notCopyright © 2018 The Apache Software Foundation. All rights reserved.