v22.01.00
Release 22.01 adds support for einsum expressions, logic functions and a subset of indexing and array manipulation routines.
Conda packages for this release are available at https://anaconda.org/legate/cunumeric.
New Features
- Convolution by @magnatelee and @lightsighter in #103
- Added few universal functions and logical operations by @ipdemes in #134
- numpy.tril and numpy.triu by @magnatelee in #144
- Einsum operation by @manopapad in #142
- Cholesky factorization by @magnatelee in #160
- Implemented split routines and a test by @sbak5 in #152
- Choose operation by @ipdemes in #146
Improvements
- Convolve Cache for cuFFT by @lightsighter in #109
- Warmup iterations for Richardson-Lucy by @magnatelee in #113
- Remove NumPyAllocation by @magnatelee in #118
- Update for new data ingest interface by @manopapad in #105
- Enable some temporarily commented-out tests by @manopapad in #119
- Testcase for legate.core!94 by @manopapad in #120
- Use built-in reduction op by @magnatelee in #136
- Managing CUDA library contexts directly in cuNumeric by @magnatelee in #138
- Support for cuSOLVER by @magnatelee in #139
- Make CUDA library context cache thread safe by @magnatelee in #141
- Use .cu for CUDA library management by @magnatelee in #145
- Some reusable test input generators by @manopapad in #153
- Fix Wundefined-var-template clang warning by @manopapad in #154
- Add eager fallback mode to testing script by @manopapad in #156
- Add eager tests by @marcinz in #157
- Small additions to test input generators by @manopapad in #159
- No longer need to reserve one dim for reductions by @manopapad in #161
- Use a per-device stream cache for CUDA library calls by @magnatelee in #165
- Simple tiling heuristic for Cholesky factorization by @magnatelee in #167
- Fix clang-format config to include cu,cuh,inl files by @manopapad in #168
- LEGATE_ABORT is now a statement by @magnatelee in #169
- Preloading CUDA libraries by @magnatelee in #171
- Use CHECK_* macros in a couple more places by @manopapad in #172
- Fix some invocations of complex constructors by @manopapad in #173
- Add a switch to not call tril on Cholesky outputs by @magnatelee in #174
- Do python install on custom dir w/o eggs by @manopapad in #177
- Refined 'tests/array_split.py' w/ more essential input shapes by @sbak5 in #178
- WIP: adding logic for DIAGONAL by @ipdemes in #170
- Stack and concatenate routines including subroutines by @sbak5 in #175
- Refactoring by @magnatelee in #181
Bug Fixes
- Fix #111 by @magnatelee in #115
- math.prod not available in python 3.7 by @manopapad in #129
- Fix some compiler warnings by @magnatelee in #130
- dot: fix error message on unsupported array dimensions by @manopapad in #133
- Fix slot calculation in reduction kernel by @manopapad in #148
- Port fix for #79 by @manopapad in #155
- Build OpenBLAS with CROSS option to prevent tests at compile time by @marcinz in #158
- Pin setuptools version, to work around breaking change by @manopapad in #164
- Workaround for a bug in cuBLAS < 11.4 by @magnatelee in #185
- Cannot install cuNumeric to different dir than Legate Core by @manopapad in #186
- Adjust error tolerance for float16, to avoid spurious test failure by @manopapad in #166
Documentation
- Adding contributions file by @marcinz in #147
- Update docstrings by @magnatelee in #188
New Contributors
- @lightsighter made their first contribution in #109
- @ipdemes made their first contribution in #134
- @pre-commit-ci made their first contribution in #151
- @sbak5 made their first contribution in #152
Full Changelog: v21.11.00...v22.01.00