BaseTokenizer
- class lightautoml.text.tokenizer.BaseTokenizer(n_jobs=4, to_string=True, **kwargs)[source]
Bases:
object
Base class for tokenizer method.
- __init__(n_jobs=4, to_string=True, **kwargs)[source]
Tokenization with simple text cleaning and preprocessing.