lightautoml.dataset

Provides an internal interface for working with data.

Dataset Interfaces

base.LAMLColumn

Basic class for pair - column, role.

base.LAMLDataset

Basic class to create dataset.

np_pd_dataset.NumpyDataset

Dataset that contains info in np.ndarray format.

np_pd_dataset.PandasDataset

Dataset that contains pd.DataFrame features and pd.Series targets.

np_pd_dataset.CSRSparseDataset

Dataset that contains sparse features and np.ndarray targets.

Roles

Role contains information about the column, which determines how it is processed.

ColumnRole

Abstract class for column role.

NumericRole

Numeric role.

CategoryRole

Category role.

TextRole

Text role.

DatetimeRole

Datetime role.

TargetRole

Target role.

GroupRole

Group role.

DropRole

Drop role.

WeightsRole

Weights role.

FoldsRole

Folds role.

PathRole

Path role.

Utils

Utilities for working with the structure of a dataset.

roles_parser

Parser of roles.

get_common_concat

Get concatenation function for datasets of different types.

numpy_and_pandas_concat

Concat of numpy and pandas dataset.

concatenate

Dataset concatenation function.