lightautoml.transformers

Basic feature generation steps and helper utils.

Base Classes

LAMLTransformer

Base class for transformer method (like sklearn, but works with datasets).

SequentialTransformer

Transformer that contains the list of transformers and apply one by one sequentially.

UnionTransformer

Transformer that apply the sequence on transformers in parallel on dataset and concatenate the result.

ColumnsSelector

Select columns to pass to another transformers (or feature selection).

ColumnwiseUnion

Apply 1 columns transformer to all columns.

BestOfTransformers

Apply multiple transformers and select best.

ConvertDataset

Convert dataset to given type.

ChangeRoles

Change data roles (include dtypes etc).

Numeric

NaNFlags

Create NaN flags.

FillnaMedian

Fillna with median.

FillInf

Fill inf with nan to handle as nan value.

LogOdds

Convert probs to logodds.

StandardScaler

Classic StandardScaler.

QuantileBinning

Discretization of numeric features by quantiles.

Categorical

LabelEncoder

Simple LabelEncoder in order of frequency.

OHEEncoder

Simple OneHotEncoder over label encoded categories.

FreqEncoder

Labels are encoded with frequency in train data.

OrdinalEncoder

Encoding ordinal categories into numbers.

TargetEncoder

Out-of-fold target encoding.

MultiClassTargetEncoder

Out-of-fold target encoding for multiclass task.

CatIntersectstions

Build label encoded intertsections of categorical variables.

Datetime

TimeToNum

Basic conversion strategy, used in selection one-to-one transformers.

BaseDiff

Basic conversion strategy, used in selection one-to-one transformers.

DateSeasons

Basic conversion strategy, used in selection one-to-one transformers.

Decompositions

PCATransformer

PCA.

SVDTransformer

TruncatedSVD.

Text

TunableTransformer

Base class for ML transformers.

TfidfTextTransformer

Simple Tfidf vectorizer.

TokenizerTransformer

Simple tokenizer transformer.

OneToOneTransformer

Out-of-fold sgd model prediction to reduce dimension of encoded text data.

ConcatTextTransformer

Concat text features transformer.

AutoNLPWrap

Calculate text embeddings.

Image

ImageFeaturesTransformer

Simple image histogram.

AutoCVWrap

Calculate image embeddings.