You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Repro (this should be run on 2 A100s or H100s to make sure the Mixtral model fits into GPU memory):
# First install vllm from master via https://docs.vllm.ai/en/latest/getting_started/installation.html#build-from-source# Then clone an install https://github.com/EleutherAI/lm-evaluation-harness
git clone https://github.com/EleutherAI/lm-evaluation-harness
cd lm-evaluation-harness
pip install -e .# Now run the evaluation harness
lm_eval --model vllm --model_args pretrained=mistralai/Mixtral-8x7B-Instruct-v0.1,tensor_parallel_size=2 --tasks mmlu --num_fewshot 5
The API breakage is fixed in EleutherAI/lm-evaluation-harness#1549, but after the fix it is extremely slow (about 40x slower than before), so not really feasible to run:
I think this is because without specifying a batch size the harness defaults to bs 1. Should be fixed if you use --batch_size auto and we can take advantage of vLLM's continuous batching.
This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!
Since #3065, the eval suite https://github.com/EleutherAI/lm-evaluation-harness is broken.
Repro (this should be run on 2 A100s or H100s to make sure the Mixtral model fits into GPU memory):
This fails with
The API breakage is fixed in EleutherAI/lm-evaluation-harness#1549, but after the fix it is extremely slow (about 40x slower than before), so not really feasible to run:
Being able to run the evaluation harness in a timely manner is crucial so we can ensure model performance doesn't degrade.
The text was updated successfully, but these errors were encountered: