[Core] Adding token ranks along with logprobs #3516

SwapnilDreams100 · 2024-03-19T23:58:41Z

Adds the token rank functionality (for both prompt tokens and sample tokens) within the Logprobs object, by adding the rank property to Logprobs.
This PR also adds a couple of tests to verify the ranks in greedy and non-greedy settings.

The implementation here is meant to align with functionality in https://github.com/IBM/text-generation-inference (IBM's fork of HF TGI).

rkooo567 · 2024-03-20T01:55:01Z

vllm/model_executor/layers/sampler.py

@@ -447,6 +447,9 @@ def _sample(
    ]
    return sample_results

+def _get_ranks(x: torch.Tensor, indices: List[int]) -> torch.Tensor:


Can you add docstring to this func? Like the shape of x tensors, and what exactly indices mean?

rkooo567 · 2024-03-20T01:58:26Z

vllm/sequence.py

@@ -17,9 +17,9 @@
 class Logprob:
    """Infos for supporting OpenAI compatible logprobs."""
    logprob: float
+    rank: Optional[int] = None


Can you comment what this means here

njhill

Thanks @SwapnilDreams100!

Yard1 · 2024-03-20T17:23:40Z

vllm/model_executor/layers/sampler.py

+    batched_ranks_query_result = _get_ranks(
+        logprobs[batched_logprobs_query_seq_indices],
+        batched_logprobs_query_token_indices)


let's move this after line 526, since we'll have to force a CPU<->GPU sync here anyway.

Yard1

This looks good, please move the invocation as outlined in the comment before merge!

njhill · 2024-03-20T17:31:24Z

One question I guess is whether these should also be exposed in the openai API responses (but doesn't necessarily need to be addressed in this PR).

Co-authored-by: Swapnil Parekh <[email protected]>

Swapnil Parekh added 2 commits March 19, 2024 19:15

Adding token ranks

9450d6e

minor sequence parameter order change

28ae49f

SwapnilDreams100 changed the title ~~feat: adding token ranks along with logprobs~~ [CORE] Adding token ranks along with logprobs Mar 20, 2024

SwapnilDreams100 changed the title ~~[CORE] Adding token ranks along with logprobs~~ [Core] Adding token ranks along with logprobs Mar 20, 2024

rkooo567 reviewed Mar 20, 2024

View reviewed changes

Swapnil Parekh added 2 commits March 19, 2024 22:05

formatting updates

6a2bcd6

added and updated docstrings

329c3ae

Yard1 self-requested a review March 20, 2024 03:03

njhill approved these changes Mar 20, 2024

View reviewed changes

Yard1 reviewed Mar 20, 2024

View reviewed changes

Yard1 approved these changes Mar 20, 2024

View reviewed changes

moving rank code

0583304

rkooo567 approved these changes Mar 21, 2024

View reviewed changes

merge conflict resolution

9cdde0c

simon-mo merged commit 819924e into vllm-project:main Mar 25, 2024
32 checks passed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 31, 2024

[Core] Adding token ranks along with logprobs (vllm-project#3516)

bbf0fe7

Co-authored-by: Swapnil Parekh <[email protected]>

dtrifiro mentioned this pull request May 15, 2024

bump ubi base image tag opendatahub-io/vllm#24

Merged

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

[Core] Adding token ranks along with logprobs (vllm-project#3516)

ca906f8

Co-authored-by: Swapnil Parekh <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] Adding token ranks along with logprobs #3516

[Core] Adding token ranks along with logprobs #3516

SwapnilDreams100 commented Mar 19, 2024

rkooo567 Mar 20, 2024

SwapnilDreams100 Mar 20, 2024

rkooo567 Mar 20, 2024

SwapnilDreams100 Mar 20, 2024

njhill left a comment

Yard1 Mar 20, 2024

SwapnilDreams100 Mar 20, 2024

Yard1 left a comment

njhill commented Mar 20, 2024 •

edited

Loading

[Core] Adding token ranks along with logprobs #3516

[Core] Adding token ranks along with logprobs #3516

Conversation

SwapnilDreams100 commented Mar 19, 2024

rkooo567 Mar 20, 2024

Choose a reason for hiding this comment

SwapnilDreams100 Mar 20, 2024

Choose a reason for hiding this comment

rkooo567 Mar 20, 2024

Choose a reason for hiding this comment

SwapnilDreams100 Mar 20, 2024

Choose a reason for hiding this comment

njhill left a comment

Choose a reason for hiding this comment

Yard1 Mar 20, 2024

Choose a reason for hiding this comment

SwapnilDreams100 Mar 20, 2024

Choose a reason for hiding this comment

Yard1 left a comment

Choose a reason for hiding this comment

njhill commented Mar 20, 2024 • edited Loading

njhill commented Mar 20, 2024 •

edited

Loading