Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
PR 687 (resolves issue #684 ): Fix a failure when reading data from a parquet file (and not a parquet directory). PR 686 (resolves issue #685 ): Silenty omit fields that have unsupported types. Previously were failing loudly making parquet stores with such fields unusable with Petastorm.