v2.16.0
andrewballantyne
released this
06 Oct 14:08
·
2154 commits
to main
since this release
Notable Changes
- AcceleratorProfiles (and Habana support) are now available
- Accelerators are the group term for GPUs & AI specific chips that don’t include the Graphics (“G” in GPU) aspect
- An AcceleratorProfile is a way to describe the backing resource
- AcceleratorProfiles have “identifier” to speak
nvidia.com/gpu
or any brand that a Node would understand and be able to schedule- Each profile has the ability to come with a description & and display name to curate it for your users
- Tolerations can not be specified more openly for existing Nvidia GPU and any future accelerators you want to support
- Since this is the grouping term for our existing support of Nvidia GPUs, a couple things are updated in this area
- On upgrade, if you have GPUs installed, we should detect that and create you a
migrated-gpu
AcceleratorProfile - If you have any problems with this, an AcceleratorProfile can be created in the application namespace (where the Dashboard is deployed)
- On upgrade, if you have GPUs installed, we should detect that and create you a
- We welcome all feedback, please log an issue if you see gaps in our flow for your use-cases
- Bug fixes
What's Changed
- Added accelerator CRD by @Gkrumbach07 in #1451
- add copy to clipboard icon to tooltips by @Gkrumbach07 in #1561
- Accelerator user flow by @Gkrumbach07 in #1555
- added gpu migration by @Gkrumbach07 in #1618
- added accelerator detection by @Gkrumbach07 in #1628
- bug fixes by @Gkrumbach07 in #1645
- fix lint errors in accelerator support by @Gkrumbach07 in #1668
- Fix bug in migration for GPUS by @Gkrumbach07 in #1677
- revert add rbac accelerator role by @Gkrumbach07 in #1753
- add rbac accelerator role by @Gkrumbach07 in #1754
- Minor accelerator fixes by @Gkrumbach07 in #1764
- Added Edge to SME areas by @andrewballantyne in #1789
- Add UX and Tracker internal templates by @andrewballantyne in #1794
- update scroll container selector for JumpLinks by @christianvogt in #1803
- Improve logging with objects by @andrewballantyne in #1828
- Fix issue displaying error in Inference Service by @lucferbux in #1823
- Revamp empty state in Model Serving Global by @lucferbux in #1796
- Revert "Revamp empty state in Model Serving Global" by @andrewballantyne in #1834
- Add duplicate option for non-OOTB custom serving runtimes by @DaoDaoNoCode in #1819
- Tech Debt Template by @andrewballantyne in #1830
- Missing Duplicate Run from Triggered Action Menu by @dpanshug in #1715
- Bump graphql from 16.6.0 to 16.8.1 in /frontend by @dependabot in #1841
- fix accelerator detection logic by @Gkrumbach07 in #1865
- Fix issue with resource types for cpus by @lucferbux in #1790
- update cluster role to allow accelerator profile creation by @Gkrumbach07 in #1877
- move from cluster role to role for accelerator create by @Gkrumbach07 in #1879
- Habana to
main
by @andrewballantyne in #1884 - Limit the use of
useTableColumnSort
hook by @DaoDaoNoCode in #1784 - Add/update documentation on the dashboard flows by @andrewballantyne in #1851
- use generic object state on data connections and prevent resetting on edit by @DaoDaoNoCode in #1732
- Add owner references to Elyra role binding when creating notebooks by @DaoDaoNoCode in #1770
- refactor hook test utils to extend jest matchers by @christianvogt in #1887
- adding workflow to auto-add issues to project boards by @dchourasia in #1899
- Bump postcss from 8.4.23 to 8.4.31 in /frontend by @dependabot in #1904
- Bump actions/checkout from 3 to 4 by @dependabot in #1888
- Delete pipeline server from the Pipelines Section in Projects page by @manaswinidas in #1743
- refresh accelerators after reset by @Gkrumbach07 in #1903
- Update Custom Serving Runtime tooltip with Openshift resource information by @ppadti in #1772
New Contributors
- @dchourasia made their first contribution in #1899
Full Changelog: v2.15.0...v2.16.0