Class Tokenizer
- java.lang.Object
-
- org.apache.sysds.runtime.transform.tokenize.Tokenizer
-
- All Implemented Interfaces:
Serializable
public class Tokenizer extends Object implements Serializable
- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description static intTOKENIZE_NUM_BLOCKS
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidallocateInternalRepresentation(int numDocuments)FrameBlockapply(FrameBlock out, int k)voidbuild(FrameBlock in, int k)List<DependencyTask<?>>getBuildTasks(FrameBlock in)intgetMaxNumRows(int inRows)longgetNumCols()intgetNumRowsEstimate()Types.ValueType[]getSchema()FrameBlocktokenize(FrameBlock in)FrameBlocktokenize(FrameBlock in, int k)
-
-
-
Method Detail
-
getSchema
public Types.ValueType[] getSchema()
-
getMaxNumRows
public int getMaxNumRows(int inRows)
-
getNumRowsEstimate
public int getNumRowsEstimate()
-
getNumCols
public long getNumCols()
-
allocateInternalRepresentation
public void allocateInternalRepresentation(int numDocuments)
-
tokenize
public FrameBlock tokenize(FrameBlock in)
-
tokenize
public FrameBlock tokenize(FrameBlock in, int k)
-
apply
public FrameBlock apply(FrameBlock out, int k)
-
getBuildTasks
public List<DependencyTask<?>> getBuildTasks(FrameBlock in)
-
build
public void build(FrameBlock in, int k)
-
-