public class TokenizerPreWhitespaceSplit extends Object implements TokenizerPre
Constructor and Description |
---|
TokenizerPreWhitespaceSplit(List<Integer> idCols,
int tokenizeCol,
org.apache.wink.json4j.JSONObject params) |
Modifier and Type | Method and Description |
---|---|
List<org.apache.sysds.runtime.transform.tokenize.Tokenizer.Token> |
splitToTokens(String text) |
List<org.apache.sysds.runtime.transform.tokenize.Tokenizer.DocumentToTokens> |
tokenizePre(FrameBlock in) |
public List<org.apache.sysds.runtime.transform.tokenize.Tokenizer.Token> splitToTokens(String text)
public List<org.apache.sysds.runtime.transform.tokenize.Tokenizer.DocumentToTokens> tokenizePre(FrameBlock in)
tokenizePre
in interface TokenizerPre
Copyright © 2021 The Apache Software Foundation. All rights reserved.