Class Tokenizer
- java.lang.Object
 - 
- org.apache.sysds.runtime.transform.tokenize.Tokenizer
 
 
- 
- All Implemented Interfaces:
 Serializable
public class Tokenizer extends Object implements Serializable
- See Also:
 - Serialized Form
 
 
- 
- 
Field Summary
Fields Modifier and Type Field Description static intTOKENIZE_NUM_BLOCKS 
- 
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description voidallocateInternalRepresentation(int numDocuments)FrameBlockapply(FrameBlock out, int k)voidbuild(FrameBlock in, int k)List<DependencyTask<?>>getBuildTasks(FrameBlock in)intgetMaxNumRows(int inRows)longgetNumCols()intgetNumRowsEstimate()Types.ValueType[]getSchema()FrameBlocktokenize(FrameBlock in)FrameBlocktokenize(FrameBlock in, int k) 
 - 
 
- 
- 
Method Detail
- 
getSchema
public Types.ValueType[] getSchema()
 
- 
getMaxNumRows
public int getMaxNumRows(int inRows)
 
- 
getNumRowsEstimate
public int getNumRowsEstimate()
 
- 
getNumCols
public long getNumCols()
 
- 
allocateInternalRepresentation
public void allocateInternalRepresentation(int numDocuments)
 
- 
tokenize
public FrameBlock tokenize(FrameBlock in)
 
- 
tokenize
public FrameBlock tokenize(FrameBlock in, int k)
 
- 
apply
public FrameBlock apply(FrameBlock out, int k)
 
- 
getBuildTasks
public List<DependencyTask<?>> getBuildTasks(FrameBlock in)
 
- 
build
public void build(FrameBlock in, int k)
 
 - 
 
 -