Class KMVSketch

  • All Implemented Interfaces:
    MatrixSketch

    public class KMVSketch
    extends CountDistinctSketch
    KMV synopsis(for k minimum values) Distinct-Value Estimation Kevin S. Beyer, Peter J. Haas, Berthold Reinwald, Yannis Sismanis, Rainer Gemulla: On synopses for distinct‐value estimation under multiset operations. SIGMOD 2007 TODO: Add multi-threaded version
    • Constructor Detail

      • KMVSketch

        public KMVSketch​(Operator op)
    • Method Detail

      • getValue

        public MatrixBlock getValue​(MatrixBlock blkIn)
        Description copied from interface: MatrixSketch
        Get scalar distinct count from an input matrix block.
        Parameters:
        blkIn - An input block to estimate the number of distinct values in
        Returns:
        The result matrix block containing the distinct count estimate
      • getValueFromSketch

        public MatrixBlock getValueFromSketch​(CorrMatrixBlock arg0)
        Description copied from interface: MatrixSketch
        Obtain matrix distinct count value from estimation Used for estimating distinct in rows or columns.
        Parameters:
        arg0 - The sketch block to extract the count from
        Returns:
        The result matrix block
      • create

        public CorrMatrixBlock create​(MatrixBlock blkIn)
        Description copied from interface: MatrixSketch
        Create an initial sketch of a given block.
        Parameters:
        blkIn - A block to process
        Returns:
        A sketch