A consistent table management library in python
pyarrow<7
as a dependency.~kartothek.io.eager.copy_dataset
{.interpreted-text
role="meth"} to copy and optionally rename datasets within one store
or between stores (eager only)~kartothek.io.eager_cube.copy_cube
{.interpreted-text role="meth"}~kartothek.utils.predicate_converter
{.interpreted-text
role="meth"}This release rolls all the changes introduced with 4.x back to 3.20.0.
As the incompatibility between 4.0 and 5.0 will be an issue for some customers, we encourage you to use the very stable kartothek 3.20.0 and not version 4.x.
Please refer the Issue #471 for further information.
This release rolls all the changes introduced with 4.x back to 3.20.0.
As the incompatibility between 4.0 and 5.0 will be an issue for some customers, we encourage you to use the very stable kartothek 3.20.0 and not version 4.x.
Please refer the Issue #471 for further information.
This is a major release of kartothek with breaking API changes.
~kartothek.core.dataset.DatasetMetadata
{.interpreted-text
role="class"} now has an attribute called [schema]{.title-ref} which
replaces the previous attribute [table_meta]{.title-ref} and returns
only a single schemapandas.DataFrame
{.interpreted-text role="class"}~kartothek.core.dataset.DatasetMetadataBase.to_dict
{.interpreted-text
role="meth"} and
~kartothek.core.dataset.DatasetMetadata.from_dict
{.interpreted-text
role="meth"} changed replacing a dictionary in
[table_meta]{.title-ref} with the simple [schema]{.title-ref}kartothek.io.eager.commit_dataset
{.interpreted-text role="func"}
since these arguments didn't have any effectkartothek.io.eager.write_single_partition
{.interpreted-text
role="func"}~kartothek.io.eager.store_dataframes_as_dataset
{.interpreted-text
role="func"} now requires a list as an inputTrue
. The behaviour for [False]{.title-ref}
will be deprecated and removed in the next major release~kartothek.io.dask.dataframe.update_dataset_from_ddf
{.interpreted-text
role="func"}~kartothek.io.dask.dataframe.update_dataset_from_ddf
{.interpreted-text
role="func"} and
~kartothek.io.dask.dataframe.store_dataset_from_ddf
{.interpreted-text
role="func"} now return a [dd.core.Scalar]{.title-ref} object. This
enables all [dask.DataFrame]{.title-ref} graph optimizations by
default.~kartothek.io.dask.dataframe.collect_dataset_metadata
{.interpreted-text
role="func"}This will be the final release in the 3.X series. Please ensure your
existing codebase does not raise any DeprecationWarning from kartothek
and migrate your import paths ahead of time to the new
kartothek.api
{.interpreted-text role="mod"} modules to ensure a smooth
migration to 4.X.
kartothek.api
{.interpreted-text role="mod"} as the
public definition of the API. See also
versioning
{.interpreted-text role="doc"}.~kartothek.io.eager.read_dataset_as_dataframes
{.interpreted-text
role="func"} and
~kartothek.io.iter.read_dataset_as_dataframes__iterator
{.interpreted-text
role="func"} now correctly return categoricals as requested for
misaligned categories.pyarrow==3
as a dependency.~kartothek.io_components.utils.align_categories
{.interpreted-text
role="func"} for dataframes with missings and of non-categorical
dtype.