Skip to content

[Bug] recent change of param "block_size_x/y, unroll" in dlight/gpu/matmul.py significantly decrease q4f16_1 prefill speed on android 8gen3 device #126

[Bug] recent change of param "block_size_x/y, unroll" in dlight/gpu/matmul.py significantly decrease q4f16_1 prefill speed on android 8gen3 device

[Bug] recent change of param "block_size_x/y, unroll" in dlight/gpu/matmul.py significantly decrease q4f16_1 prefill speed on android 8gen3 device #126

Triggered via issue September 21, 2024 14:16
Status Skipped
Total duration 3s
Artifacts

tag_teams.yml

on: issues
tag-teams
0s
tag-teams
Fit to window
Zoom out
Zoom in