gluonts.dataset.arrow.file 模块#
- class gluonts.dataset.arrow.file.ArrowFile(path: pathlib.Path, _start: int = 0, _take: Union[int, NoneType] = None)[source]#
基类:
gluonts.dataset.arrow.file.File
- property batch_offsets#
- path: pathlib.Path#
- reader: pyarrow.ipc.RecordBatchFileReader#
- property schema#
- class gluonts.dataset.arrow.file.ArrowStreamFile(path: pathlib.Path, _start: int = 0, _take: Union[int, NoneType] = None)[source]#
基类:
gluonts.dataset.arrow.file.File
- path: pathlib.Path#
- class gluonts.dataset.arrow.file.File[source]#
基类:
object
- SUFFIXES = {'.arrow', '.feather', '.parquet'}#
- static infer(path: pathlib.Path) Union[gluonts.dataset.arrow.file.ArrowFile, gluonts.dataset.arrow.file.ArrowStreamFile, gluonts.dataset.arrow.file.ParquetFile] [source]#
通过检查提供的路径返回ArrowFile、ArrowStreamFile或ParquetFile。
Arrow 的 random-access 格式以 ARROW1 开头,因此我们查看提供的文件以查找它。
- class gluonts.dataset.arrow.file.ParquetFile(path: pathlib.Path, _start: int = 0, _take: Union[int, NoneType] = None, _row_group_sizes: List[int] = <factory>)[source]#
基类:
gluonts.dataset.arrow.file.File
- path: pathlib.Path#
- reader: pyarrow.parquet.core.ParquetFile#