New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add speculative decoding params to lm_bench #1221

Merged

eaidova merged 3 commits into openvinotoolkit:master from sbalandi:llm_bench_sd

Nov 20, 2024

Contributor

sbalandi commented Nov 18, 2024 •

edited

Loading

Task: CVS-155520

github-actions bot added category: llm_bench category: sampling labels

sbalandi requested a review from eaidova

November 18, 2024 10:21

sbalandi force-pushed the llm_bench_sd branch from 7bfee69 to 7ac52d7 Compare

November 18, 2024 10:35

eaidova reviewed

View reviewed changes

tools/llm_bench/benchmark.py Outdated Show resolved Hide resolved

eaidova reviewed

View reviewed changes

tools/llm_bench/benchmark.py Outdated Show resolved Hide resolved

Collaborator

eaidova commented Nov 18, 2024

@sbalandi could you please include test into GHA for speculative decognig case

eaidova reviewed

View reviewed changes

tools/llm_bench/benchmark.py Outdated Show resolved Hide resolved

ilya-lavrenov requested a review from iefode

November 18, 2024 11:30

ilya-lavrenov reviewed

View reviewed changes

tools/llm_bench/benchmark.py Outdated Show resolved Hide resolved

ilya-lavrenov added this to the 2025.0 milestone

ilya-lavrenov added the port to LTS label

iefode reviewed

View reviewed changes

tools/llm_bench/llm_bench_utils/model_utils.py Outdated Show resolved Hide resolved

github-actions bot added the category: GHA label

eaidova reviewed

View reviewed changes

tools/llm_bench/benchmark.py Outdated Show resolved Hide resolved

eaidova reviewed

View reviewed changes

tools/llm_bench/benchmark.py Outdated Show resolved Hide resolved

eaidova reviewed

View reviewed changes

tools/llm_bench/llm_bench_utils/ov_utils.py Outdated Show resolved Hide resolved

Collaborator

eaidova commented Nov 19, 2024 •

edited

Loading

@sbalandi looks like the selected for test models are too large. Maybe we can use something less compute expensive? e.g. tinyllama with fp16 and int4/int8 precision as draft. Also you can use pre-converted models from here https://huggingface.co/collections/OpenVINO/llm-6687aaa2abca3bbcec71a9bd changing optimum-cli to huggingface-cli download command

sbalandi force-pushed the llm_bench_sd branch 2 times, most recently from 8caa747 to 795563d Compare

November 19, 2024 13:27

eaidova reviewed

View reviewed changes

tools/llm_bench/benchmark.py Show resolved Hide resolved

sbalandi force-pushed the llm_bench_sd branch 3 times, most recently from 35050d3 to e1444e2 Compare

November 19, 2024 15:56

eaidova reviewed

View reviewed changes

tools/llm_bench/llm_bench_utils/ov_utils.py Outdated Show resolved Hide resolved

sbalandi force-pushed the llm_bench_sd branch from e1444e2 to 566a710 Compare

November 19, 2024 17:03

eaidova reviewed

View reviewed changes

tools/llm_bench/llm_bench_utils/ov_utils.py Show resolved Hide resolved

eaidova reviewed

View reviewed changes

tools/llm_bench/llm_bench_utils/ov_utils.py Outdated Show resolved Hide resolved

eaidova reviewed

View reviewed changes

tools/llm_bench/llm_bench_utils/ov_utils.py Outdated Show resolved Hide resolved

eaidova reviewed

View reviewed changes

tools/llm_bench/llm_bench_utils/ov_utils.py Outdated Show resolved Hide resolved

sbalandi force-pushed the llm_bench_sd branch 3 times, most recently from 87eb848 to e4155c3 Compare

November 19, 2024 19:17

sbalandi added 2 commits

November 19, 2024 20:16


          Add speculative decoding params to lm_bench

c5fb131


          update

e4155c3

eaidova approved these changes

View reviewed changes


          Merge branch 'master' into llm_bench_sd

1a9a157

eaidova enabled auto-merge

November 20, 2024 05:51

github-actions bot removed the category: sampling label

eaidova added this pull request to the merge queue

Merged via the queue into openvinotoolkit:master with commit a2e1ae9

53 of 54 checks passed

ilya-lavrenov pushed a commit to ilya-lavrenov/openvino.genai that referenced this pull request


          Add speculative decoding params to lm_bench (openvinotoolkit#1221)

7a44c33

Task: [CVS-155520](https://jira.devtools.intel.com/browse/CVS-155520)

---------

Co-authored-by: Ekaterina Aidova <[email protected]>

ilya-lavrenov mentioned this pull request

Port fixes from master to 2024.5.1 / 2024.6.0 #1239

Merged

github-merge-queue bot pushed a commit that referenced this pull request


          Port fixes from master to 2024.5.1 / 2024.6.0 (#1239)

da7a7ca

**Ported:**
- #1187
- #1189
- #1192
- #1196
- #1202
- #1204
- #1210
- #1217
- #1218
- #1221
- #1222
- #1228

ilya-lavrenov removed the port to LTS label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: GHA category: llm_bench