Class AColGroupValue

    • Method Detail

      • decompressToDenseBlock

        public final void decompressToDenseBlock​(DenseBlock db,
                                                 int rl,
                                                 int ru,
                                                 int offR,
                                                 int offC)
        Description copied from class: AColGroup
        Decompress into the DenseBlock. (no NNZ handling)
        Specified by:
        decompressToDenseBlock in class AColGroup
        Parameters:
        db - Target DenseBlock
        rl - Row to start decompression from
        ru - Row to end decompression at
        offR - Row offset into the target to decompress
        offC - Column offset into the target to decompress
      • decompressToSparseBlock

        public final void decompressToSparseBlock​(SparseBlock sb,
                                                  int rl,
                                                  int ru,
                                                  int offR,
                                                  int offC)
        Description copied from class: AColGroup
        Decompress into the SparseBlock. (no NNZ handling) Note this method is allowing to calls to append since it is assumed that the sparse column indexes are sorted afterwards
        Specified by:
        decompressToSparseBlock in class AColGroup
        Parameters:
        sb - Target SparseBlock
        rl - Row to start decompression from
        ru - Row to end decompression at
        offR - Row offset into the target to decompress
        offC - Column offset into the target to decompress
      • getNumValues

        public int getNumValues()
        Description copied from class: AColGroup
        Obtain number of distinct tuples in contained sets of values associated with this column group. If the column group is uncompressed the number or rows is returned.
        Specified by:
        getNumValues in class AColGroup
        Returns:
        the number of distinct sets of values associated with the bitmaps in this column group
      • getCounts

        public final int[] getCounts()
        Returns the counts of values inside the dictionary. If already calculated it will return the previous counts. This produce an overhead in cases where the count is calculated, but the overhead will be limited to number of distinct tuples in the dictionary. The returned counts always contains the number of zero tuples as well if there are some contained, even if they are not materialized.
        Returns:
        The count of each value in the MatrixBlock.
      • getCachedCounts

        public final int[] getCachedCounts()
        Get the cached counts. If they are not materialized or the garbage collector have removed them, then null is returned.
        Returns:
        The counts or null.
      • getExactSizeOnDisk

        public long getExactSizeOnDisk()
        Description copied from class: AColGroup
        Returns the exact serialized size of column group. This can be used for example for buffer preallocation.
        Overrides:
        getExactSizeOnDisk in class AColGroup
        Returns:
        exact serialized size for column group
      • getCounts

        public abstract int[] getCounts​(int[] out)
      • computeColSums

        public void computeColSums​(double[] c,
                                   int nRows)
        Description copied from class: AColGroup
        Compute the column sum
        Specified by:
        computeColSums in class AColGroup
        Parameters:
        c - The array to add the column sum to.
        nRows - The number of rows in the column group.
      • copy

        public AColGroupValue copy()
        Description copied from class: AColGroup
        Get a copy of this column group note this is only a shallow copy. Meaning only the object wrapping index structures, column indexes and dictionaries are copied.
        Specified by:
        copy in class AColGroup
        Returns:
        Get a copy of this column group.
      • containsValue

        public boolean containsValue​(double pattern)
        Description copied from class: AColGroup
        Detect if the column group contains a specific value.
        Specified by:
        containsValue in class AColGroup
        Parameters:
        pattern - The value to look for.
        Returns:
        boolean saying true if the value is contained.
      • getNumberNonZeros

        public long getNumberNonZeros​(int nRows)
        Description copied from class: AColGroup
        Get the number of nonZeros contained in this column group.
        Specified by:
        getNumberNonZeros in class AColGroup
        Parameters:
        nRows - The number of rows in the column group, this is used for groups that does not contain information about how many rows they have.
        Returns:
        The nnz.
      • forceMatrixBlockDictionary

        public void forceMatrixBlockDictionary()
      • rightMultByMatrix

        public final AColGroup rightMultByMatrix​(MatrixBlock right)
        Description copied from class: AColGroup
        Right matrix multiplication with this column group. This method can return null, meaning that the output overlapping group would have been empty.
        Specified by:
        rightMultByMatrix in class AColGroup
        Parameters:
        right - The MatrixBlock on the right of this matrix multiplication
        Returns:
        The new Column Group or null that is the result of the matrix multiplication.
      • estimateInMemorySize

        public long estimateInMemorySize()
        Description copied from class: AColGroup
        Get the upper bound estimate of in memory allocation for the column group.
        Overrides:
        estimateInMemorySize in class AColGroup
        Returns:
        an upper bound on the number of bytes used to store this ColGroup in memory.
      • replace

        public AColGroup replace​(double pattern,
                                 double replace)
        Description copied from class: AColGroup
        Make a copy of the column group values, and replace all values that match pattern with replacement value.
        Specified by:
        replace in class AColGroup
        Parameters:
        pattern - The value to look for
        replace - The value to replace the other value with
        Returns:
        A new Column Group, reusing the index structure but with new values.
      • centralMoment

        public CM_COV_Object centralMoment​(CMOperator op,
                                           int nRows)
        Description copied from class: AColGroup
        Central Moment instruction executed on a column group.
        Specified by:
        centralMoment in class AColGroup
        Parameters:
        op - The Operator to use.
        nRows - The number of rows contained in the ColumnGroup.
        Returns:
        A Central Moment object.
      • rexpandCols

        public AColGroup rexpandCols​(int max,
                                     boolean ignore,
                                     boolean cast,
                                     int nRows)
        Description copied from class: AColGroup
        Expand the column group to multiple columns. (one hot encode the column group)
        Specified by:
        rexpandCols in class AColGroup
        Parameters:
        max - The number of columns to expand to and cutoff values at.
        ignore - If zero and negative values should be ignored.
        cast - If the double values contained should be cast to whole numbers.
        nRows - The number of rows in the column group.
        Returns:
        A new column group containing max number of columns.