Fix huggingface response caching bug #659

bkorycki · 2024-10-31T20:52:23Z

The huggingface chat_completions API returns a dataclass/dict object ChatCompletionOutput. It inherits from dict, which is a "typeable" data structure and thus allows it to be cached. However, there was a bug when running this SUT on a prompt that has been previously cached. Because it's a dataclass and not a plain dict, the deserialization produced an error.

This PR addresses this bug by putting the response object into pydantic form before returning it in evaluate.

I also added some testing to make sure all SUTs are cacheable.

github-actions · 2024-10-31T20:52:33Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

rogthefrog

Nice! I bet this one was a bear to debug.

rogthefrog · 2024-11-01T03:39:35Z

plugins/huggingface/modelgauge/suts/huggingface_chat_completion.py

    ) -> SUTResponse:
        completions = []
        for choice in response.choices:
-            text = choice.message.content
+            text = choice["message"]["content"]


You may want to try the benedict library. It wraps dict and gives you convenient functionality like accessing elements via dot notation or dict notation interchangeably (choice["message"] or choice.message).

wpietri

Thanks for sorting this out.

bkorycki added 10 commits October 30, 2024 15:05

handle (non-nested) caching dataclasses

3b8abc1

Make huggingface chat completions cacheable

052fae0

convert dataclass -> typeddata

efc9736

dataclasses are typeable

fa9cfee

huggingface SUT treats nested response fields as dicts

8fdfef6

test all suts

35ebd47

revert typed data changes

462c358

huggingface transforms dataclass response to pydantic in evaluate

86d751d

id is str not int

68d3e54

put back tests i commented out

5802c4f

bkorycki requested review from wpietri and rogthefrog October 31, 2024 20:52

bkorycki requested a review from a team as a code owner October 31, 2024 20:52

rogthefrog approved these changes Nov 1, 2024

View reviewed changes

wpietri approved these changes Nov 1, 2024

View reviewed changes

bkorycki merged commit da6662f into main Nov 4, 2024
4 checks passed

github-actions bot locked and limited conversation to collaborators Nov 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix huggingface response caching bug #659

Fix huggingface response caching bug #659

bkorycki commented Oct 31, 2024

github-actions bot commented Oct 31, 2024

rogthefrog left a comment

rogthefrog Nov 1, 2024

wpietri left a comment

Fix huggingface response caching bug #659

Fix huggingface response caching bug #659

Conversation

bkorycki commented Oct 31, 2024

github-actions bot commented Oct 31, 2024

rogthefrog left a comment

Choose a reason for hiding this comment

rogthefrog Nov 1, 2024

Choose a reason for hiding this comment

wpietri left a comment

Choose a reason for hiding this comment