PCATransformer

class lightautoml.transformers.decomposition.PCATransformer(subs=None, random_state=42, n_components=500)[source]

Bases: LAMLTransformer

PCA.

Parameters:
  • subs (Optional[int]) – Subsample to fit algorithm. If None - full data.

  • random_state (int) – Random state to take subsample.

  • n_components (int) – Number of PCA components

property features

Features list.

fit(dataset)[source]

Fit algorithm on dataset.

Parameters:

dataset (Union[NumpyDataset, PandasDataset]) – Sparse or Numpy dataset of text features.

Returns:

Self.

transform(dataset)[source]

Transform input dataset to PCA representation.

Parameters:

dataset (Union[NumpyDataset, PandasDataset]) – Pandas or Numpy dataset of text features.

Return type:

NumpyDataset

Returns:

Numpy dataset with text embeddings.