universal_transfer_operator.datasets.file.types.parquet
Module Contents
Classes
Concrete implementation to handle Parquet file type |
- class universal_transfer_operator.datasets.file.types.parquet.ParquetFileTypes(path, normalize_config=None)
Bases:
universal_transfer_operator.datasets.file.types.base.FileTypes
Concrete implementation to handle Parquet file type
- Parameters:
path (str) –
normalize_config (dict | None) –
- property name
get file type
- export_to_dataframe(stream, columns_names_capitalization='original', **kwargs)
Read parquet file from one of the supported locations and return dataframe.
- Parameters:
stream – file stream object
columns_names_capitalization – determines whether to convert all columns to lowercase/uppercase in the resulting dataframe
- create_from_dataframe(df, stream)
Write parquet file to one of the supported locations.
- Parameters:
df (pandas.DataFrame) – pandas dataframe
stream (io.TextIOWrapper) – file stream object
- Return type:
None