Petastorm Versions Save

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

v0.11.1

2 years ago

PR 687 (resolves issue #684 ): Fix a failure when reading data from a parquet file (and not a parquet directory). PR 686 (resolves issue #685 ): Silenty omit fields that have unsupported types. Previously were failing loudly making parquet stores with such fields unusable with Petastorm.

v0.9.3

3 years ago

Thanks to our new contributors: Travis Addair and Ryan (rb-determined-ai).

  • Retire support for Python 2.
  • PR 568: Added additional kwargs for Spark Dataset Converter.
  • PR 564: Expose filters (PyArrow filters) argument in make_reader and make_batch_reader

v0.7.3rc0

5 years ago

v0.7.2

5 years ago