read_data

lightautoml.reader.tabular_batch_generator.read_data(data, features_names=None, n_jobs=1, read_csv_params=None)[source]

Get DataFrame from different data formats.

Note

Supported now data formats:

  • Path to .csv, .parquet, .feather files.

  • ndarray, or dict of ndarray. For example, {'data': X...}. In this case, roles are optional, but train_features and valid_features required.

  • pandas.DataFrame.

Parameters:
Return type:

Tuple[DataFrame, Optional[dict]]

Returns:

Tuple with read data and new roles mapping.