public class CompressionSettings extends Object
Modifier and Type | Field and Description |
---|---|
boolean |
allowSharedDictionary
Share DDC Dictionaries between ColGroups.
|
static int |
BITMAP_BLOCK_SZ
Size of the blocks used in a blocked bitmap representation.
|
double |
coCodePercentage
A Cocode parameter that differ in behavior based on compression method, in general it is a value that reflects
aggressively likely coCoding is used.
|
CoCoderFactory.PartitionerType |
columnPartitioner
The selected method for column partitioning used in CoCoding compressed columns
|
CostEstimatorFactory.CostType |
costComputationType
The cost computation type for the compression
|
SampleEstimatorFactory.EstimationType |
estimationType
The sample type used for sampling
|
boolean |
isInSparkInstruction
Is a spark instruction
|
boolean |
lossy
True if lossy compression is enabled
|
int |
maxColGroupCoCode
The maximum number of columns CoCoded allowed
|
int |
maxSampleSize
The maximum size of the sample extracted.
|
double |
minimumCompressionRatio
The minimum compression ratio to achieve.
|
int |
minimumSampleSize
The minimum size of the sample extracted.
|
double |
samplingRatio
The sampling ratio used when choosing ColGroups.
|
int |
seed
If the seed is -1 then the system used system millisecond time and class hash for seeding.
|
boolean |
sortValuesByLength
Sorting of values by physical length helps by 10-20%, especially for serial, while slight performance decrease
for parallel incl multi-threaded, hence not applied for distributed operations (also because compression time +
garbage collection increases)
|
boolean |
transposed
Transpose input matrix, to optimize access when extracting bitmaps.
|
String |
transposeInput
Boolean specifying which transpose setting is used, can be auto, true or false
|
EnumSet<AColGroup.CompressionType> |
validCompressions
Valid Compressions List, containing the ColGroup CompressionTypes that are allowed to be used for the compression
Default is to always allow for Uncompromisable ColGroup.
|
public static final int BITMAP_BLOCK_SZ
public final boolean sortValuesByLength
public final double samplingRatio
public final boolean allowSharedDictionary
public final String transposeInput
public final int seed
public final boolean lossy
public final CoCoderFactory.PartitionerType columnPartitioner
public final CostEstimatorFactory.CostType costComputationType
public final int maxColGroupCoCode
public final double coCodePercentage
public final EnumSet<AColGroup.CompressionType> validCompressions
public final int minimumSampleSize
public final int maxSampleSize
public final SampleEstimatorFactory.EstimationType estimationType
public boolean transposed
public final double minimumCompressionRatio
public final boolean isInSparkInstruction
Copyright © 2021 The Apache Software Foundation. All rights reserved.