FIX Failing Llama tests due to new kv cache #1832

BenjaminBossan · 2024-06-06T09:10:21Z

Requires to install transformers from source (so normal CI won't pick this up).

To replicate, run these tests:

pytest tests/test_multitask_prompt_tuning.py
pytest tests/test_decoder_models.py -k
"test_trl_internal_testing_tiny_random_LlamaForCausalLM and (prefix or prompt) and (generate or disable)"

This should result in 17 failing tests.

The issue is that past_key_values can now be an instance of DynamicCache. Therefore, just indexing into it won't work anymore. The solution is to check the type and if it's not a tuple, use the methods on the cache object instead.

Requires to install transformers from source (so normal CI won't pick this up). To replicate, run these tests: pytest tests/test_multitask_prompt_tuning.py pytest tests/test_decoder_models.py -k "test_trl_internal_testing_tiny_random_LlamaForCausalLM and (prefix or prompt) and (generate or disable)" This should result in 17 failing tests. The issue is that past_key_values can now be an instance of DynamicCache. Therefore, just indexing into it won't work anymore. The solution is to check the type and if it's not a tuple, use the methods on the cache object instead.

HuggingFaceDocBuilderDev · 2024-06-06T09:13:54Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

younesbelkada

Thanks a lot for fixing !
Importing Cache class from transformers.cache_utils might make our code bloated since that object exists only on newer transformers versions. This solution looks great

past_key_value can also be a list, not only tuple

671e48e

younesbelkada approved these changes Jun 6, 2024

View reviewed changes

BenjaminBossan marked this pull request as ready for review June 6, 2024 13:47

BenjaminBossan changed the title ~~[WIP] FIX Failing Llama tests due to new kv cache~~ FIX Failing Llama tests due to new kv cache Jun 6, 2024

BenjaminBossan merged commit 03798a9 into huggingface:main Jun 6, 2024
14 checks passed

BenjaminBossan deleted the fix-llama-tests-kv-cache branch June 6, 2024 13:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX Failing Llama tests due to new kv cache #1832

FIX Failing Llama tests due to new kv cache #1832

BenjaminBossan commented Jun 6, 2024

HuggingFaceDocBuilderDev commented Jun 6, 2024

younesbelkada left a comment

FIX Failing Llama tests due to new kv cache #1832

FIX Failing Llama tests due to new kv cache #1832

Conversation

BenjaminBossan commented Jun 6, 2024

HuggingFaceDocBuilderDev commented Jun 6, 2024

younesbelkada left a comment

Choose a reason for hiding this comment