Class IdentityDictionarySlice
- java.lang.Object
-
- org.apache.sysds.runtime.compress.colgroup.dictionary.ADictionary
-
- org.apache.sysds.runtime.compress.colgroup.dictionary.IdentityDictionary
-
- org.apache.sysds.runtime.compress.colgroup.dictionary.IdentityDictionarySlice
-
- All Implemented Interfaces:
Serializable,IDictionary
public class IdentityDictionarySlice extends IdentityDictionary
- See Also:
- Serialized Form
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from interface org.apache.sysds.runtime.compress.colgroup.dictionary.IDictionary
IDictionary.DictType
-
-
Field Summary
-
Fields inherited from interface org.apache.sysds.runtime.compress.colgroup.dictionary.IDictionary
LOG
-
-
Constructor Summary
Constructors Constructor Description IdentityDictionarySlice(int nRowCol, boolean withEmpty, int l, int u)Create a Identity matrix dictionary slice.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description voidaddToEntry(double[] v, int fr, int to, int nCol, int rep)Adds the dictionary entry from this dictionary to the v dictionary rep times.voidaddToEntryVectorized(double[] v, int f1, int f2, int f3, int f4, int f5, int f6, int f7, int f8, int t1, int t2, int t3, int t4, int t5, int t6, int t7, int t8, int nCol)Vectorized add to entry, this call helps with a bit of locality for the cache.voidaggregateCols(double[] c, Builtin fn, IColIndex colIndexes)Aggregates the columns into the target double array provided.double[]aggregateRows(Builtin fn, int nCol)Aggregate all entries in the rows.IDictionaryclone()Returns a deep clone of the dictionary.voidcolProduct(double[] res, int[] counts, IColIndex colIndexes)Calculate the column product of the dictionary weighted by counts.voidcolSum(double[] c, int[] counts, IColIndex colIndexes)Get the column sum of the values contained in the dictionaryvoidcolSumSq(double[] c, int[] counts, IColIndex colIndexes)Get the column sum of the values contained in the dictionarybooleancontainsValue(double pattern)Detect if the dictionary contains a specific value.booleanequals(IDictionary o)Indicate if the other dictionary is equal to this.IDictionary.DictTypegetDictType()Get the dictionary type this dictionary is.longgetExactSizeOnDisk()Calculate the space consumption if the dictionary is stored on disk.longgetInMemorySize()Returns the memory usage of the dictionary.static longgetInMemorySize(int numberColumns)MatrixBlockDictionarygetMBDict(int nCol)Get this dictionary as a MatrixBlock dictionary.longgetNumberNonZeros(int[] counts, int nCol)Calculate the number of non zeros in the dictionary.doublegetSparsity()Get the sparsity of the dictionary.StringgetString(int colIndexes)Get a string representation of the dictionary, that considers the layout of the data.doublegetValue(int i)Get Specific value contained in the dictionary at index.doublegetValue(int r, int c, int nCol)Get Specific value contain in dictionary at index.double[]getValues()Get all the values contained in the dictionary as a linearized double array.IDictionarypreaggValuesFromDense(int numVals, IColIndex colIndexes, IColIndex aggregateColumns, double[] b, int cut)Pre Aggregate values for Right Matrix Multiplication.double[]productAllRowsToDouble(int nCol)Method to product all rows to a column vector.double[]productAllRowsToDoubleWithDefault(double[] defaultTuple)Method to product all rows to a column vector with a default value added in the end.static IdentityDictionaryread(DataInput in)IDictionaryreplace(double pattern, double replace, int nCol)Make a copy of the values, and replace all values that match pattern with replacement value.IDictionaryreplaceWithReference(double pattern, double replace, double[] reference)Make a copy of the values, and replace all values that match pattern with replacement value.IDictionaryscaleTuples(int[] scaling, int nCol)Scale all tuples contained in the dictionary by the scaling factor given in the int list.IDictionarysliceOutColumnRange(int idxStart, int idxEnd, int previousNumberOfColumns)Modify the dictionary by removing columns not within the index range.doublesum(int[] counts, int ncol)Get the sum of the values contained in the dictionarydouble[]sumAllRowsToDouble(int nrColumns)Method used as a pre-aggregate of each tuple in the dictionary, to single double values.double[]sumAllRowsToDoubleSq(int nrColumns)Method used as a pre-aggregate of each tuple in the dictionary, to single double values.double[]sumAllRowsToDoubleWithDefault(double[] defaultTuple)Do exactly the same as the sumAllRowsToDouble but also sum the array given to a extra index in the end of the array.double[]sumAllRowsToDoubleWithReference(double[] reference)Method used as a pre-aggregate of each tuple in the dictionary, to single double values with a reference.doublesumSq(int[] counts, int ncol)Get the square sum of the values contained in the dictionaryStringtoString()voidwrite(DataOutput out)Write the dictionary to a DataOutput.-
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.dictionary.IdentityDictionary
addToEntry, aggregate, aggregateColsWithReference, aggregateRowsWithDefault, aggregateRowsWithReference, aggregateWithReference, applyScalarOp, applyScalarOpAndAppend, applyScalarOpWithReference, applyUnaryOp, applyUnaryOpAndAppend, applyUnaryOpWithReference, binOpLeft, binOpLeftAndAppend, binOpLeftWithReference, binOpRight, binOpRight, binOpRightAndAppend, binOpRightWithReference, cbind, centralMoment, centralMomentWithDefault, centralMomentWithReference, colProductWithReference, colSumSqWithReference, containsValueWithReference, getMBDict, getNumberNonZerosWithReference, getNumberOfValues, MMDict, MMDictDense, MMDictScaling, MMDictScalingDense, MMDictScalingSparse, MMDictSparse, multiplyScalar, product, productAllRowsToDoubleWithReference, productWithDefault, productWithReference, reorder, rexpandCols, rexpandColsWithReference, subtractTuple, sumAllRowsToDoubleSqWithDefault, sumAllRowsToDoubleSqWithReference, sumSqWithReference, TSMMToUpperTriangle, TSMMToUpperTriangleDense, TSMMToUpperTriangleDenseScaling, TSMMToUpperTriangleScaling, TSMMToUpperTriangleSparse, TSMMToUpperTriangleSparseScaling, TSMMWithScaling
-
Methods inherited from class org.apache.sysds.runtime.compress.colgroup.dictionary.ADictionary
centralMoment, centralMomentWithDefault, centralMomentWithReference, correctNan, doubleToString, equals, equals
-
-
-
-
Constructor Detail
-
IdentityDictionarySlice
public IdentityDictionarySlice(int nRowCol, boolean withEmpty, int l, int u)Create a Identity matrix dictionary slice. It behaves as if allocated a Sparse Matrix block but exploits that the structure is known to have certain properties.- Parameters:
nRowCol- the number of rows and columns in this identity matrix.withEmpty- If the matrix should contain an empty row in the end.l- the index lower to start atu- the index upper to end at (not inclusive)
-
-
Method Detail
-
getValues
public double[] getValues()
Description copied from interface:IDictionaryGet all the values contained in the dictionary as a linearized double array.- Specified by:
getValuesin interfaceIDictionary- Overrides:
getValuesin classIdentityDictionary- Returns:
- linearized double array
-
getValue
public double getValue(int i)
Description copied from interface:IDictionaryGet Specific value contained in the dictionary at index.- Specified by:
getValuein interfaceIDictionary- Overrides:
getValuein classIdentityDictionary- Parameters:
i- The index to extract the value from- Returns:
- The value contained at the index
-
getValue
public final double getValue(int r, int c, int nCol)Description copied from interface:IDictionaryGet Specific value contain in dictionary at index.- Specified by:
getValuein interfaceIDictionary- Overrides:
getValuein classIdentityDictionary- Parameters:
r- Row targetc- Col targetnCol- nCol in dictionary- Returns:
- value
-
getInMemorySize
public long getInMemorySize()
Description copied from interface:IDictionaryReturns the memory usage of the dictionary.- Specified by:
getInMemorySizein interfaceIDictionary- Overrides:
getInMemorySizein classIdentityDictionary- Returns:
- a long value in number of bytes for the dictionary.
-
getInMemorySize
public static long getInMemorySize(int numberColumns)
-
aggregateRows
public double[] aggregateRows(Builtin fn, int nCol)
Description copied from interface:IDictionaryAggregate all entries in the rows.- Specified by:
aggregateRowsin interfaceIDictionary- Overrides:
aggregateRowsin classIdentityDictionary- Parameters:
fn- The aggregate functionnCol- The number of columns contained in the dictionary.- Returns:
- Aggregates for this dictionary tuples.
-
aggregateCols
public void aggregateCols(double[] c, Builtin fn, IColIndex colIndexes)Description copied from interface:IDictionaryAggregates the columns into the target double array provided.- Specified by:
aggregateColsin interfaceIDictionary- Overrides:
aggregateColsin classIdentityDictionary- Parameters:
c- The target double array, this contains the full number of columns, therefore the colIndexes for this specific dictionary is needed.fn- The function to apply to individual columnscolIndexes- The mapping to the target columns from the individual columns
-
clone
public IDictionary clone()
Description copied from interface:IDictionaryReturns a deep clone of the dictionary.- Specified by:
clonein interfaceIDictionary- Overrides:
clonein classIdentityDictionary- Returns:
- A deep clone
-
getDictType
public IDictionary.DictType getDictType()
Description copied from interface:IDictionaryGet the dictionary type this dictionary is.- Specified by:
getDictTypein interfaceIDictionary- Overrides:
getDictTypein classIdentityDictionary- Returns:
- The Dictionary type this is.
-
sumAllRowsToDouble
public double[] sumAllRowsToDouble(int nrColumns)
Description copied from interface:IDictionaryMethod used as a pre-aggregate of each tuple in the dictionary, to single double values. Note if the number of columns is one the actual dictionaries values are simply returned.- Specified by:
sumAllRowsToDoublein interfaceIDictionary- Overrides:
sumAllRowsToDoublein classIdentityDictionary- Parameters:
nrColumns- The number of columns in the ColGroup to know how to get the values from the dictionary.- Returns:
- a double array containing the row sums from this dictionary.
-
sumAllRowsToDoubleWithDefault
public double[] sumAllRowsToDoubleWithDefault(double[] defaultTuple)
Description copied from interface:IDictionaryDo exactly the same as the sumAllRowsToDouble but also sum the array given to a extra index in the end of the array.- Specified by:
sumAllRowsToDoubleWithDefaultin interfaceIDictionary- Overrides:
sumAllRowsToDoubleWithDefaultin classIdentityDictionary- Parameters:
defaultTuple- The default row to sum in the end index returned.- Returns:
- a double array containing the row sums from this dictionary.
-
sumAllRowsToDoubleWithReference
public double[] sumAllRowsToDoubleWithReference(double[] reference)
Description copied from interface:IDictionaryMethod used as a pre-aggregate of each tuple in the dictionary, to single double values with a reference.- Specified by:
sumAllRowsToDoubleWithReferencein interfaceIDictionary- Overrides:
sumAllRowsToDoubleWithReferencein classIdentityDictionary- Parameters:
reference- The reference values to add to each cell.- Returns:
- a double array containing the row sums from this dictionary.
-
sumAllRowsToDoubleSq
public double[] sumAllRowsToDoubleSq(int nrColumns)
Description copied from interface:IDictionaryMethod used as a pre-aggregate of each tuple in the dictionary, to single double values. Note if the number of columns is one the actual dictionaries values are simply returned.- Specified by:
sumAllRowsToDoubleSqin interfaceIDictionary- Overrides:
sumAllRowsToDoubleSqin classIdentityDictionary- Parameters:
nrColumns- The number of columns in the ColGroup to know how to get the values from the dictionary.- Returns:
- a double array containing the row sums from this dictionary.
-
productAllRowsToDouble
public double[] productAllRowsToDouble(int nCol)
Description copied from interface:IDictionaryMethod to product all rows to a column vector.- Specified by:
productAllRowsToDoublein interfaceIDictionary- Overrides:
productAllRowsToDoublein classIdentityDictionary- Parameters:
nCol- The number of columns in the ColGroup to know how to get the values from the dictionary.- Returns:
- A row product
-
productAllRowsToDoubleWithDefault
public double[] productAllRowsToDoubleWithDefault(double[] defaultTuple)
Description copied from interface:IDictionaryMethod to product all rows to a column vector with a default value added in the end.- Specified by:
productAllRowsToDoubleWithDefaultin interfaceIDictionary- Overrides:
productAllRowsToDoubleWithDefaultin classIdentityDictionary- Parameters:
defaultTuple- The default row that aggregate to last cell- Returns:
- A row product
-
colSum
public void colSum(double[] c, int[] counts, IColIndex colIndexes)Description copied from interface:IDictionaryGet the column sum of the values contained in the dictionary- Specified by:
colSumin interfaceIDictionary- Overrides:
colSumin classIdentityDictionary- Parameters:
c- The output array allocated to contain all column groups output.counts- The counts of the individual tuples.colIndexes- The columns indexes of the parent column group, this indicate where to put the column sum into the c output.
-
colSumSq
public void colSumSq(double[] c, int[] counts, IColIndex colIndexes)Description copied from interface:IDictionaryGet the column sum of the values contained in the dictionary- Specified by:
colSumSqin interfaceIDictionary- Overrides:
colSumSqin classIdentityDictionary- Parameters:
c- The output array allocated to contain all column groups output.counts- The counts of the individual tuples.colIndexes- The columns indexes of the parent column group, this indicate where to put the column sum into the c output.
-
colProduct
public void colProduct(double[] res, int[] counts, IColIndex colIndexes)Description copied from interface:IDictionaryCalculate the column product of the dictionary weighted by counts.- Specified by:
colProductin interfaceIDictionary- Overrides:
colProductin classIdentityDictionary- Parameters:
res- The result vector to put the result intocounts- The weighted count of individual tuplescolIndexes- The column indexes.
-
sum
public double sum(int[] counts, int ncol)Description copied from interface:IDictionaryGet the sum of the values contained in the dictionary- Specified by:
sumin interfaceIDictionary- Overrides:
sumin classIdentityDictionary- Parameters:
counts- The counts of the individual tuplesncol- The number of columns contained- Returns:
- The sum scaled by the counts provided.
-
sumSq
public double sumSq(int[] counts, int ncol)Description copied from interface:IDictionaryGet the square sum of the values contained in the dictionary- Specified by:
sumSqin interfaceIDictionary- Overrides:
sumSqin classIdentityDictionary- Parameters:
counts- The counts of the individual tuplesncol- The number of columns contained- Returns:
- The square sum scaled by the counts provided.
-
sliceOutColumnRange
public IDictionary sliceOutColumnRange(int idxStart, int idxEnd, int previousNumberOfColumns)
Description copied from interface:IDictionaryModify the dictionary by removing columns not within the index range.- Specified by:
sliceOutColumnRangein interfaceIDictionary- Overrides:
sliceOutColumnRangein classIdentityDictionary- Parameters:
idxStart- The column index to start at.idxEnd- The column index to end at (not inclusive)previousNumberOfColumns- The number of columns contained in the dictionary.- Returns:
- A dictionary containing the sliced out columns values only.
-
containsValue
public boolean containsValue(double pattern)
Description copied from interface:IDictionaryDetect if the dictionary contains a specific value.- Specified by:
containsValuein interfaceIDictionary- Overrides:
containsValuein classIdentityDictionary- Parameters:
pattern- The value to search for- Returns:
- true if the value is contained else false.
-
getNumberNonZeros
public long getNumberNonZeros(int[] counts, int nCol)Description copied from interface:IDictionaryCalculate the number of non zeros in the dictionary. The number of non zeros should be scaled with the counts given. This gives the exact number of non zero values in the parent column group.- Specified by:
getNumberNonZerosin interfaceIDictionary- Overrides:
getNumberNonZerosin classIdentityDictionary- Parameters:
counts- The counts of each dictionary entrynCol- The number of columns in this dictionary- Returns:
- The nonZero count
-
getMBDict
public MatrixBlockDictionary getMBDict(int nCol)
Description copied from interface:IDictionaryGet this dictionary as a MatrixBlock dictionary. This allows us to use optimized kernels coded elsewhere in the system, such as matrix multiplication. Return null if the matrix is empty.- Specified by:
getMBDictin interfaceIDictionary- Overrides:
getMBDictin classIdentityDictionary- Parameters:
nCol- The number of columns contained in this column group.- Returns:
- A Dictionary containing a MatrixBlock.
-
getString
public String getString(int colIndexes)
Description copied from interface:IDictionaryGet a string representation of the dictionary, that considers the layout of the data.- Specified by:
getStringin interfaceIDictionary- Overrides:
getStringin classIdentityDictionary- Parameters:
colIndexes- The number of columns in the dictionary.- Returns:
- A string that is nicer to print.
-
toString
public String toString()
- Overrides:
toStringin classIdentityDictionary
-
scaleTuples
public IDictionary scaleTuples(int[] scaling, int nCol)
Description copied from interface:IDictionaryScale all tuples contained in the dictionary by the scaling factor given in the int list.- Specified by:
scaleTuplesin interfaceIDictionary- Overrides:
scaleTuplesin classIdentityDictionary- Parameters:
scaling- The amount to multiply the given tuples withnCol- The number of columns contained in this column group.- Returns:
- A New dictionary (since we don't want to modify the underlying dictionary)
-
write
public void write(DataOutput out) throws IOException
Description copied from interface:IDictionaryWrite the dictionary to a DataOutput.- Specified by:
writein interfaceIDictionary- Overrides:
writein classIdentityDictionary- Parameters:
out- the output sink to write the dictionary to.- Throws:
IOException- if the sink fails.
-
read
public static IdentityDictionary read(DataInput in) throws IOException
- Throws:
IOException
-
getExactSizeOnDisk
public long getExactSizeOnDisk()
Description copied from interface:IDictionaryCalculate the space consumption if the dictionary is stored on disk.- Specified by:
getExactSizeOnDiskin interfaceIDictionary- Overrides:
getExactSizeOnDiskin classIdentityDictionary- Returns:
- the long count of bytes to store the dictionary.
-
replace
public IDictionary replace(double pattern, double replace, int nCol)
Description copied from interface:IDictionaryMake a copy of the values, and replace all values that match pattern with replacement value. If needed add a new column index.- Specified by:
replacein interfaceIDictionary- Overrides:
replacein classIdentityDictionary- Parameters:
pattern- The value to look forreplace- The value to replace the other value withnCol- The number of columns contained in the dictionary.- Returns:
- A new Column Group, reusing the index structure but with new values.
-
replaceWithReference
public IDictionary replaceWithReference(double pattern, double replace, double[] reference)
Description copied from interface:IDictionaryMake a copy of the values, and replace all values that match pattern with replacement value. If needed add a new column index. With reference such that each value in the dict is considered offset by the values contained in the reference.- Specified by:
replaceWithReferencein interfaceIDictionary- Overrides:
replaceWithReferencein classIdentityDictionary- Parameters:
pattern- The value to look forreplace- The value to replace the other value withreference- The reference tuple to add to all entries when replacing- Returns:
- A new Column Group, reusing the index structure but with new values.
-
getSparsity
public double getSparsity()
Description copied from interface:IDictionaryGet the sparsity of the dictionary.- Specified by:
getSparsityin interfaceIDictionary- Overrides:
getSparsityin classIdentityDictionary- Returns:
- a sparsity between 0 and 1
-
preaggValuesFromDense
public IDictionary preaggValuesFromDense(int numVals, IColIndex colIndexes, IColIndex aggregateColumns, double[] b, int cut)
Description copied from interface:IDictionaryPre Aggregate values for Right Matrix Multiplication.- Specified by:
preaggValuesFromDensein interfaceIDictionary- Overrides:
preaggValuesFromDensein classIdentityDictionary- Parameters:
numVals- The number of values contained in this dictionarycolIndexes- The column indexes that is associated with the parent column groupaggregateColumns- The column to aggregate, this is preprocessed, to find remove consideration for empty columnsb- The values in the right hand side matrixcut- The number of columns in b.- Returns:
- A new dictionary with the pre aggregated values.
-
addToEntryVectorized
public void addToEntryVectorized(double[] v, int f1, int f2, int f3, int f4, int f5, int f6, int f7, int f8, int t1, int t2, int t3, int t4, int t5, int t6, int t7, int t8, int nCol)Description copied from interface:IDictionaryVectorized add to entry, this call helps with a bit of locality for the cache.- Specified by:
addToEntryVectorizedin interfaceIDictionary- Overrides:
addToEntryVectorizedin classIdentityDictionary- Parameters:
v- The target dictionary (dense double array)f1- From index 1f2- From index 2f3- From index 3f4- From index 4f5- From index 5f6- From index 6f7- From index 7f8- From index 8t1- To index 1t2- To index 2t3- To index 3t4- To index 4t5- To index 5t6- To index 6t7- To index 7t8- To index 8nCol- Number of columns in the dictionary
-
addToEntry
public void addToEntry(double[] v, int fr, int to, int nCol, int rep)Description copied from interface:IDictionaryAdds the dictionary entry from this dictionary to the v dictionary rep times.- Specified by:
addToEntryin interfaceIDictionary- Overrides:
addToEntryin classIdentityDictionary- Parameters:
v- The target dictionary (dense double array)fr- The from index is the tuple index to copy from.to- The to index is the row index to copy into.nCol- The number of columns in both casesrep- The number of repetitions to apply (simply multiply do not loop)
-
equals
public boolean equals(IDictionary o)
Description copied from interface:IDictionaryIndicate if the other dictionary is equal to this.- Specified by:
equalsin interfaceIDictionary- Overrides:
equalsin classIdentityDictionary- Parameters:
o- The other object- Returns:
- If it is equal
-
-