NVTabular Versions Save

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

v23.08.00

8 months ago

v23.06.00

10 months ago

v23.05.00

11 months ago

What’s Changed

:ant: Bug Fixes

Fix list slicing of np.ndarrays on CPU @karlhigley (#1817)

:rocket: Features

Add support for int8 values with Categorify inference @oliverholworthy (#1818)

:wrench: Maintenance

atomize added tags in TagAsUserID and TagAsItemID @radekosmulski (#1815)
Update requirements for Merlin packages to minimum version of 23.04 @karlhigley (#1804)
Update conda package publish for muliple python versions @oliverholworthy (#1805)
Remove use of deprecated numpy aliases of builtin types @oliverholworthy (#1813)
Add workflows to check base branch and set stable branch @oliverholworthy (#1811)
Update tag pattern in GitHub Workflows @oliverholworthy (#1812)
Cleanup Unused Test Dependencies @oliverholworthy (#1810)

v23.04.00

1 year ago

What’s Changed

🐜 Bug Fixes

Update import of device functions to use merlin.core versions @oliverholworthy (#1786)
Enable DatasetGen usage in CPU environment @oliverholworthy (#1776)

🚀 Features

Enable CategorifyTransform inference operator to run on int16 types @oliverholworthy (#1798)
use merlin compat for imports of gpu specific packages @jperez999 (#1791)
Enable Workflow.transform to be run with a DataFrame type @oliverholworthy (#1777)

🔧 Maintenance

add concurrency setting to stop tests when new commits get pushed to PRs @nv-alaiacano (#1801)
use merlin compat for imports of gpu specific packages @jperez999 (#1791)
Replace nvtabular.utils with merlin.core.compat @edknv (#1795)
Disable package builds on pull requests @oliverholworthy (#1789)
Use None as default value of cpu in test_torch_dataloader @oliverholworthy (#1788)
Use None as default value of cpu in test_column_similarity @oliverholworthy (#1787)
Split up conda and PyPI package build/release jobs @oliverholworthy (#1780)
Fix docs tox environment @alexanderronquillo (#1775)
update conftest for backwards compat and new api for to parquet call @jperez999 (#1784)
Remove tests for sparse tensors in dataloader @oliverholworthy (#1783)
Update default value of cpu to None in dataset fixture @oliverholworthy (#1779)
Fix Dataloader Unittest - which broke by new DL structure @bschifferer (#1782)

v23.02.00

1 year ago

What’s Changed

🐜 Bug Fixes

Add support for serializing modules involved in LambdaOp execution by value @willb (#1741)

🚀 Features

add transform for df @jperez999 (#1734)

🔧 Maintenance

Clean up the way shapes are computed and specified @karlhigley (#1760)
Update passenv in test-gpu to use valid configuration @oliverholworthy (#1762)
Fix the inference code's deprecation warning @karlhigley (#1757)
Specify Minimum Python Version as 3.8 in package @oliverholworthy (#1732)
Update NVT to be compatible with shapes in ColumnSchemas @karlhigley (#1758)
fix gpu visibilty issues on privileged container @jperez999 (#1759)
Move nest_asyncio dependency to test deps @karlhigley (#1755)
add gcp label to jenkinsfile @jperez999 (#1744)
Replace nvtabular inference back-end with python @jperez999 (#1771)
Update NVT operators and workflows to use Merlin dtypes @karlhigley (#1707)
Add Formatter (Prettier) for YAML and Markdown files @oliverholworthy (#1733)
add tf gpu allocator env var to tox @jperez999 (#1747)
Add check for urls present in requires_dist @oliverholworthy (#1728)
adding workflow to setup gha runner for GPU CI @jperez999 (#1739)
Run the tests against the main branch of Merlin Core @karlhigley (#1756)

New Contributors

@willb made their first contribution in https://github.com/NVIDIA-Merlin/NVTabular/pull/1741

Full Changelog: https://github.com/NVIDIA-Merlin/NVTabular/compare/v1.8.1...v23.02.00

v1.8.1

1 year ago

What’s Changed

Patch release on top of v1.8.0

Quicker installs with build binary distributions published to PyPI (using cibuildwheel) @karlhigley (#1754)

Full Changelog: https://github.com/NVIDIA-Merlin/NVTabular/compare/v1.8.0...v1.8.1

v1.8.0

1 year ago

What’s Changed

🐜 Bug Fixes

Fix output error occurring due to check if it is a dict or not @rnyak (#1742)
Remove min value count from properties when using sparse_max @oliverholworthy (#1705)

📄 Documentation

Address virtual developer review feedback @mikemckiernan (#1724)
docs: Add semver to calver banner @mikemckiernan (#1699)

🔧 Maintenance

remove test references that are no longer available @jperez999 (#1730)
remove integration tests for notebooks no longer available @jperez999 (#1729)
Use pre-commit for lint checks in GitHub Actions Workflow @oliverholworthy (#1723)
Remove echo from command in tox.ini @oliverholworthy (#1725)
Migrate the legacy examples to the Merlin repo @karlhigley (#1711)
Handle data loader as an iterator @oliverholworthy (#1720)
Release draft fix @jperez999 (#1712)
Add Jenkinsfile @AyodeAwe (#1702)
Update package requires_dist to remove extras that are not required @oliverholworthy (#1727)

Full Changelog: https://github.com/NVIDIA-Merlin/NVTabular/compare/v1.7.0...v1.8.0

v1.7.0

1 year ago

What’s Changed

🐜 Bug Fixes

fix tox to use correct branch in release tags @jperez999 (#1710)
Update metrics keys in example notebook tests @karlhigley (#1703)
Fix first/last groupby aggregation on list columns @rjzamora (#1693)

📄 Documentation

docs: Add basic SEO configuration @mikemckiernan (#1697)

🔧 Maintenance

Upload binary wheels for nvtabular @benfred (#1696)
Use merlin-dataloader package @benfred (#1694)

v1.6.0

1 year ago

What’s Changed

🐜 Bug Fixes

Fix Categorify bug for combo encoding with null values @rjzamora (#1652)
Fix joint Categorify with list columns @rjzamora (#1685)

📄 Documentation

update NVTabular examples @radekosmulski (#1633)
Remove examples Part 1 - Rossmann, RecSys2020, Outbrain @bschifferer (#1669)

🔧 Maintenance

adding import or skip for tensorflow framework required by examples @jperez999 (#1691)

Full Changelog: https://github.com/NVIDIA-Merlin/NVTabular/compare/v1.5.0...v1.6.0

v1.5.0

1 year ago

What’s Changed

🐜 Bug Fixes

Use Merlin DAG executors from core in integration tests @jperez999 (#1677)
Fix target encoding tagging issue @bbozkaya (#1672)

🔧 Maintenance

Remove stray file left over from Torch/Horovod multi-GPU example @karlhigley (#1674)
Use Merlin DAG executors from core in integration tests @jperez999 (#1677)
Remove poetry config @benfred (#1673)
chore: Add pybind11 as a tox requirement @mikemckiernan (#1675)
Switch to using the DAG executors from Merlin Core @karlhigley (#1666)
Use the latest version of Merlin Core from main in the tox test envs @karlhigley (#1671)
Set up tox environments for testing, linting, and building docs @karlhigley (#1667)