forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DO NOT MERGE] Vinayak/moe final hashem #127
Open
carlushuang
wants to merge
24
commits into
main
Choose a base branch
from
vinayak/moe_final_hashem
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Commits on Jul 30, 2024
-
Configuration menu - View commit details
-
Copy full SHA for dc5b660 - Browse repository at this point
Copy the full SHA dc5b660View commit details -
Configuration menu - View commit details
-
Copy full SHA for d5564d3 - Browse repository at this point
Copy the full SHA d5564d3View commit details
Commits on Aug 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1934f71 - Browse repository at this point
Copy the full SHA 1934f71View commit details -
Configuration menu - View commit details
-
Copy full SHA for e01cd34 - Browse repository at this point
Copy the full SHA e01cd34View commit details
Commits on Aug 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 24317bb - Browse repository at this point
Copy the full SHA 24317bbView commit details
Commits on Aug 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 6696142 - Browse repository at this point
Copy the full SHA 6696142View commit details
Commits on Aug 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 077cb78 - Browse repository at this point
Copy the full SHA 077cb78View commit details -
Merge branch 'main' into vinayak/moe_microbench
root committedAug 8, 2024 Configuration menu - View commit details
-
Copy full SHA for eb861ac - Browse repository at this point
Copy the full SHA eb861acView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e70d89 - Browse repository at this point
Copy the full SHA 6e70d89View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b94a75 - Browse repository at this point
Copy the full SHA 2b94a75View commit details -
Configuration menu - View commit details
-
Copy full SHA for c406afc - Browse repository at this point
Copy the full SHA c406afcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5b92aa6 - Browse repository at this point
Copy the full SHA 5b92aa6View commit details
Commits on Aug 9, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 98a31f2 - Browse repository at this point
Copy the full SHA 98a31f2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 75031c5 - Browse repository at this point
Copy the full SHA 75031c5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7c64459 - Browse repository at this point
Copy the full SHA 7c64459View commit details
Commits on Aug 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b213dfe - Browse repository at this point
Copy the full SHA b213dfeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7df58ad - Browse repository at this point
Copy the full SHA 7df58adView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ca0ad6 - Browse repository at this point
Copy the full SHA 1ca0ad6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0cbe892 - Browse repository at this point
Copy the full SHA 0cbe892View commit details -
Merge branch 'vinayak/moe_final' of https://github.com/ROCm/vllm into…
… vinayak/moe_final
Configuration menu - View commit details
-
Copy full SHA for f2f1ca5 - Browse repository at this point
Copy the full SHA f2f1ca5View commit details -
Configuration menu - View commit details
-
Copy full SHA for c43e3e8 - Browse repository at this point
Copy the full SHA c43e3e8View commit details
Commits on Aug 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ce8a86e - Browse repository at this point
Copy the full SHA ce8a86eView commit details -
Add batched prefill via VLLM_SCHED_PREFILL_COUNT
To ensure we we don't run prefills repeatedly during decode, provide a mechanism to queue up a certain number of prefills before executing. VLLM_SCHED_PREFILL_COUNT will be the minimum batch count to specify before executing. One caveat, the --scheduler-delay-factor should be used to enforce a longer prefill scheduling value. This will be set to the value in VLLM_SCHED_PREFILL_COUNT, if not explicitly provided. The need for this exists because an uneven number of prefills can lead to the queue never reaching the VLLM_SCHED_PREFILL_COUNT. Causing the server to hang
Configuration menu - View commit details
-
Copy full SHA for 8148b54 - Browse repository at this point
Copy the full SHA 8148b54View commit details -
Configuration menu - View commit details
-
Copy full SHA for b9f05ff - Browse repository at this point
Copy the full SHA b9f05ffView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.