- All Implemented Interfaces:
Serializable
, org.apache.spark.api.java.function.PairFlatMapFunction<Iterator<scala.Tuple2<Long,FrameBlock>>,Integer,Object>
- Enclosing class:
- MultiReturnParameterizedBuiltinSPInstruction
public static class MultiReturnParameterizedBuiltinSPInstruction.TransformEncodeBuildFunction
extends Object
implements org.apache.spark.api.java.function.PairFlatMapFunction<Iterator<scala.Tuple2<Long,FrameBlock>>,Integer,Object>
This function pre-aggregates distinct values of recoded columns per partition (part of distributed recode map
construction, used for recoding, binning and dummy coding). We operate directly over schema-specific objects to
avoid unnecessary string conversion, as well as reduce memory overhead and shuffle.
- See Also:
- Serialized Form