Skip to content

Support Qwen2-7b MLP in int4 and transpose_value_cache=True#11968

Merged
plusbang merged 3 commits intointel-analytics:mainfrom yangw1234:qwenint4Sep 2, 2024

Commits