NumpyDataset

class lightautoml.dataset.np_pd_dataset.NumpyDataset(data, features=(), roles=None, task=None, **kwargs)[source]

Bases: lightautoml.dataset.base.LAMLDataset

Dataset that contains info in np.ndarray format.

property features

Features list.

Return type

List[str]

property roles

Roles dict.

Return type

Dict[str, ColumnRole]

__init__(data, features=(), roles=None, task=None, **kwargs)[source]

Create dataset from numpy arrays.

Parameters

Note

For different type of parameter feature there is different behavior:

  • list, should be same len as data.shape[1]

  • None - automatic set names like feat_0, feat_1 …

  • Prefix - automatic set names like Prefix_0, Prefix_1 …

For different type of parameter feature there is different behavior:

  • list, should be same len as data.shape[1].

  • None - automatic set NumericRole(np.float32).

  • ColumnRole - single role.

  • dict.

set_data(data, features=(), roles=None)[source]

Inplace set data, features, roles for empty dataset.

Parameters

Note

For different type of parameter feature there is different behavior:

  • List, should be same len as data.shape[1]

  • None - automatic set names like feat_0, feat_1 …

  • Prefix - automatic set names like Prefix_0, Prefix_1 …

For different type of parameter feature there is different behavior:

  • List, should be same len as data.shape[1].

  • None - automatic set NumericRole(np.float32).

  • ColumnRole - single role.

  • dict.

to_numpy()[source]

Empty method to convert to numpy.

Return type

NumpyDataset

Returns

Same NumpyDataset.

to_csr()[source]

Convert to csr.

Return type

CSRSparseDataset

Returns

Same dataset in CSRSparseDatatset format.

to_pandas()[source]

Convert to PandasDataset.

Return type

PandasDataset

Returns

Same dataset in PandasDataset format.

static from_dataset(dataset)[source]

Convert random dataset to numpy.

Return type

NumpyDataset

Returns

numpy dataset.