Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[KVCache] Fix attention prefill kernel for Metal and Android
This PR fixes two bugs of the attention prefill ragged kernel. * The first bug is the unroll of loop `ki`, which causes the TIR build failure in the PointerValueTypeRewrite pass due to vector size. * The second is the tile sizes of `tile_z` and `tile_y` may violate the assertion check in `get_tile_size`.
- Loading branch information