Releases: NVIDIA-Merlin/NVTabular
Releases · NVIDIA-Merlin/NVTabular
v1.4.0
What’s Changed
⚠ Breaking Changes
- Remove FastAI notebooks @benfred (#1668)
- Fix dl @jperez999 (#1661)
- Replace cudf series ceil() with numpy ceil() @jperez999 (#1656)
🐜 Bug Fixes
- Fix integration tests that reached into
Workflow
's private methods @karlhigley (#1660) - Fix groupby on lists with cudf 22.06+ @benfred (#1654)
- Update the
Categorify
operator to set the domain max correctly @oliverholworthy (#1641) - Test LambdaOp with dask workflows @benfred (#1634)
🚀 Features
- Add sum to supported aggregations in Groupby @radekosmulski (#1638)
📄 Documentation
🔧 Maintenance
- Add optional requirement specifiers for GPU and dev requirements @karlhigley (#1664)
- Add
scipy
as a dependency @karlhigley (#1663) - Fix dl @jperez999 (#1661)
- Fix integration tests that reached into
Workflow
's private methods @karlhigley (#1660) - Update black/pylint/flake8,isort etc @benfred (#1659)
- Remove using-feature-columns nb @rnyak (#1657)
- Replace cudf series ceil() with numpy ceil() @jperez999 (#1656)
- Extract Python and Dask
Executor
classes fromWorkflow
@karlhigley (#1609) - Update
versioneer
from 0.19 to 0.23 @oliverholworthy (#1651)
v1.3.3: Increase rossmann integration test threshold (#1631)
Increase rossmann integration test threshold We were failing on some tests, because the results were 1.5e-04 different instead of the 1e-04 tolerance we had specified. Raise the tolerance to make sure tests pass
v1.3.2: Inference Integration test fix (#1627)
* skip download step, that is not allowed in CI * fix inference data creation path helper
v1.3.1
v1.3.0
What’s Changed
🐜 Bug Fixes
- Don't install tests with nvtabular @benfred (#1608)
- Groupby to no longer require groupby_cols in column selector @radekosmulski (#1598)
- Adjust imports in the
TritonPythonModel
forWorkflows
@karlhigley (#1604) - column names can now include aggregations in ops.Groupby @radekosmulski (#1592)
- Normalize Op using fp32 @benfred (#1597)
- Cast warning to string in configure_tensorflow @leewyang (#1587)
📄 Documentation
- docs: Add TF compat info @mikemckiernan (#1528)
🔧 Maintenance
- Fix movielens notebook data path @jperez999 (#1622)
- skip download step, that is not allowed in CI @jperez999 (#1620)
- fix tritonserver gpu id & fixed timeout for criteo integration tests @jperez999 (#1619)
- Remove unnecessary docs dependencies @mikemckiernan (#1617)
- fix ci script for integration tests and added skip check @jperez999 (#1616)
- Integration tests refactor @jperez999 (#1614)
- Don't
git pull origin main
in integration tests, use container version @karlhigley (#1610)
v1.2.2
What’s Changed
🐜 Bug Fixes
- add casting for additional aggs in groupby @radekosmulski (#1580)
📄 Documentation
- Update URLs to Criteo datasets @mikemckiernan (#1591)
🔧 Maintenance
v1.2.1
v1.2.0
What’s Changed
🐜 Bug Fixes
- remove nvtabular triton backend that seg faults on termination. @jperez999 (#1576)
- Fix LambdaOp example usage 1 @rnyak (#1561)
📄 Documentation
- Merlin offers three containers @mikemckiernan (#1581)
- Fix dataloader docstring @benfred (#1573)
- Improved docstrings of GroupBy op to reinforce the required usage of dataset.shuffle_by_keys() @gabrielspmoreira (#1551)
- Remove old support matrix table, @benfred (#1560)
- Update CONTRIBUTING to mention PR labels @mikemckiernan (#1554)
- Update changelog to point to github releases @benfred (#1549)
- Use common release-drafter workflow @mikemckiernan (#1548)
🔧 Maintenance
- Add a GA workflow that requires labels on PR's @benfred (#1579)
- Use shared implementation of triage workflow @benfred (#1577)
- Don't pull main on running NVT unittests @benfred (#1578)
- Don't build model_config_pb2 @benfred (#1566)
- Add conda builds to our github actions workflow @benfred (#1557)
- Add release-drafter workflow for generating changelogs @benfred (#1540)
- Remove message about integration tests missing @benfred (#1539)
v1.1.1: Fix install model_config_pb2.py (#1538)
We are generating the model_config_pb2.py file from a protobuf script, but doing this as part of the 'build_ext' phase - running at the same time as building our native extensions. This caused the build to not pick up the generated python file unless it already existed. Fix by moving the proto generation to the correct 'build_py' phase
v1.1.0
Known Issues
- Error when sending request to Triton after loading a Transformers4Rec PyTorch model #1502
What's Changed
- Automate pushing package to pypi by @benfred in #1505
- docs: Add attention admonition to Merlin SMX by @mikemckiernan in #1507
- added category name to domain for column properties by @jperez999 in #1508
- Fix the embedding size lookup in
Categorify
op by @karlhigley in #1511 - Max auc by @jperez999 in #1513
- Fix inf container tag in getting started TF-inf nb and polish exp README by @rnyak in #1516
- Fix for max-size categorify operator category ordering by @jperez999 in #1519
- Criteo HugeCTR Inference Configuration Fix by @bschifferer in #1522
- Add ascending param in the Groupby op by @rnyak in #1525
- Remove os.environ["TF_MEMORY_ALLOCATION"] from getting-started 03-Training-with-TF nb to avoid OOM by @rnyak in #1527
- Fix getting started 03-Training-with-HugeCTR.ipynb nb's training without printing out auc and loss metrics issue by @rnyak in #1532
- reqs fixed by @jperez999 in #1536
- docs: Add ext-toc, switch to MyST-NB by @mikemckiernan in #1529
- remove horovod example, no longer supported by @jperez999 in #1530
Full Changelog: v1.0.0...v1.1.0