NVTabular Versions Save

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

v23.08.00

8 months ago

v23.06.00

10 months ago

v23.05.00

11 months ago

Whatโ€™s Changed

:ant: Bug Fixes

  • Fix list slicing of np.ndarrays on CPU @karlhigley (#1817)

:rocket: Features

  • Add support for int8 values with Categorify inference @oliverholworthy (#1818)

:wrench: Maintenance

  • atomize added tags in TagAsUserID and TagAsItemID @radekosmulski (#1815)
  • Update requirements for Merlin packages to minimum version of 23.04 @karlhigley (#1804)
  • Update conda package publish for muliple python versions @oliverholworthy (#1805)
  • Remove use of deprecated numpy aliases of builtin types @oliverholworthy (#1813)
  • Add workflows to check base branch and set stable branch @oliverholworthy (#1811)
  • Update tag pattern in GitHub Workflows @oliverholworthy (#1812)
  • Cleanup Unused Test Dependencies @oliverholworthy (#1810)

v23.04.00

1 year ago

Whatโ€™s Changed

๐Ÿœ Bug Fixes

  • Update import of device functions to use merlin.core versions @oliverholworthy (#1786)
  • Enable DatasetGen usage in CPU environment @oliverholworthy (#1776)

๐Ÿš€ Features

  • Enable CategorifyTransform inference operator to run on int16 types @oliverholworthy (#1798)
  • use merlin compat for imports of gpu specific packages @jperez999 (#1791)
  • Enable Workflow.transform to be run with a DataFrame type @oliverholworthy (#1777)

๐Ÿ”ง Maintenance

  • add concurrency setting to stop tests when new commits get pushed to PRs @nv-alaiacano (#1801)
  • use merlin compat for imports of gpu specific packages @jperez999 (#1791)
  • Replace nvtabular.utils with merlin.core.compat @edknv (#1795)
  • Disable package builds on pull requests @oliverholworthy (#1789)
  • Use None as default value of cpu in test_torch_dataloader @oliverholworthy (#1788)
  • Use None as default value of cpu in test_column_similarity @oliverholworthy (#1787)
  • Split up conda and PyPI package build/release jobs @oliverholworthy (#1780)
  • Fix docs tox environment @alexanderronquillo (#1775)
  • update conftest for backwards compat and new api for to parquet call @jperez999 (#1784)
  • Remove tests for sparse tensors in dataloader @oliverholworthy (#1783)
  • Update default value of cpu to None in dataset fixture @oliverholworthy (#1779)
  • Fix Dataloader Unittest - which broke by new DL structure @bschifferer (#1782)

v23.02.00

1 year ago

Whatโ€™s Changed

๐Ÿœ Bug Fixes

  • Add support for serializing modules involved in LambdaOp execution by value @willb (#1741)

๐Ÿš€ Features

  • add transform for df @jperez999 (#1734)

๐Ÿ”ง Maintenance

  • Clean up the way shapes are computed and specified @karlhigley (#1760)
  • Update passenv in test-gpu to use valid configuration @oliverholworthy (#1762)
  • Fix the inference code's deprecation warning @karlhigley (#1757)
  • Specify Minimum Python Version as 3.8 in package @oliverholworthy (#1732)
  • Update NVT to be compatible with shapes in ColumnSchemas @karlhigley (#1758)
  • fix gpu visibilty issues on privileged container @jperez999 (#1759)
  • Move nest_asyncio dependency to test deps @karlhigley (#1755)
  • add gcp label to jenkinsfile @jperez999 (#1744)
  • Replace nvtabular inference back-end with python @jperez999 (#1771)
  • Update NVT operators and workflows to use Merlin dtypes @karlhigley (#1707)
  • Add Formatter (Prettier) for YAML and Markdown files @oliverholworthy (#1733)
  • add tf gpu allocator env var to tox @jperez999 (#1747)
  • Add check for urls present in requires_dist @oliverholworthy (#1728)
  • adding workflow to setup gha runner for GPU CI @jperez999 (#1739)
  • Run the tests against the main branch of Merlin Core @karlhigley (#1756)

New Contributors

Full Changelog: https://github.com/NVIDIA-Merlin/NVTabular/compare/v1.8.1...v23.02.00

v1.8.1

1 year ago

Whatโ€™s Changed

Patch release on top of v1.8.0

  • Quicker installs with build binary distributions published to PyPI (using cibuildwheel) @karlhigley (#1754)

Full Changelog: https://github.com/NVIDIA-Merlin/NVTabular/compare/v1.8.0...v1.8.1

v1.8.0

1 year ago

Whatโ€™s Changed

๐Ÿœ Bug Fixes

  • Fix output error occurring due to check if it is a dict or not @rnyak (#1742)
  • Remove min value count from properties when using sparse_max @oliverholworthy (#1705)

๐Ÿ“„ Documentation

  • Address virtual developer review feedback @mikemckiernan (#1724)
  • docs: Add semver to calver banner @mikemckiernan (#1699)

๐Ÿ”ง Maintenance

  • remove test references that are no longer available @jperez999 (#1730)
  • remove integration tests for notebooks no longer available @jperez999 (#1729)
  • Use pre-commit for lint checks in GitHub Actions Workflow @oliverholworthy (#1723)
  • Remove echo from command in tox.ini @oliverholworthy (#1725)
  • Migrate the legacy examples to the Merlin repo @karlhigley (#1711)
  • Handle data loader as an iterator @oliverholworthy (#1720)
  • Release draft fix @jperez999 (#1712)
  • Add Jenkinsfile @AyodeAwe (#1702)
  • Update package requires_dist to remove extras that are not required @oliverholworthy (#1727)

Full Changelog: https://github.com/NVIDIA-Merlin/NVTabular/compare/v1.7.0...v1.8.0

v1.7.0

1 year ago

Whatโ€™s Changed

๐Ÿœ Bug Fixes

  • fix tox to use correct branch in release tags @jperez999 (#1710)
  • Update metrics keys in example notebook tests @karlhigley (#1703)
  • Fix first/last groupby aggregation on list columns @rjzamora (#1693)

๐Ÿ“„ Documentation

  • docs: Add basic SEO configuration @mikemckiernan (#1697)

๐Ÿ”ง Maintenance

  • Upload binary wheels for nvtabular @benfred (#1696)
  • Use merlin-dataloader package @benfred (#1694)

v1.6.0

1 year ago

Whatโ€™s Changed

๐Ÿœ Bug Fixes

  • Fix Categorify bug for combo encoding with null values @rjzamora (#1652)
  • Fix joint Categorify with list columns @rjzamora (#1685)

๐Ÿ“„ Documentation

  • update NVTabular examples @radekosmulski (#1633)
  • Remove examples Part 1 - Rossmann, RecSys2020, Outbrain @bschifferer (#1669)

๐Ÿ”ง Maintenance

  • adding import or skip for tensorflow framework required by examples @jperez999 (#1691)

Full Changelog: https://github.com/NVIDIA-Merlin/NVTabular/compare/v1.5.0...v1.6.0

v1.5.0

1 year ago

Whatโ€™s Changed

๐Ÿœ Bug Fixes

  • Use Merlin DAG executors from core in integration tests @jperez999 (#1677)
  • Fix target encoding tagging issue @bbozkaya (#1672)

๐Ÿ”ง Maintenance

  • Remove stray file left over from Torch/Horovod multi-GPU example @karlhigley (#1674)
  • Use Merlin DAG executors from core in integration tests @jperez999 (#1677)
  • Remove poetry config @benfred (#1673)
  • chore: Add pybind11 as a tox requirement @mikemckiernan (#1675)
  • Switch to using the DAG executors from Merlin Core @karlhigley (#1666)
  • Use the latest version of Merlin Core from main in the tox test envs @karlhigley (#1671)
  • Set up tox environments for testing, linting, and building docs @karlhigley (#1667)