Converted PrefixLM HF snapshot must enable cache for generation in config #780

timsteuer · 2023-12-06T08:40:37Z

Environment

llmfoundry:latest

To reproduce

Steps to reproduce the behavior:

Train a prefix-lm
Convert it to Huggingface via llm-foundry/scripts/inference/convert_composer_to_hf.py
Try to generate texts with the HF snapshot

-> When generating, the model throws an exception that use_cache must be enabled in the HF config.

Expected behavior

Model generates output texts.
Manually editing the HF config and enabling the cache did the trick for me.

The text was updated successfully, but these errors were encountered:

dakinggg · 2023-12-06T21:57:05Z

use_cache is something that you can specify at model load time as a kwarg, or at generation time as a kwarg. We can probably make some adjustments to make this more automatic. Thanks!

timsteuer · 2023-12-07T08:11:50Z

Thanks for clarifying.

The key argument for having it set to true in the config by default is that generating with use_cache=false results in an exception anyway.

I guess the easiest way would be to adjust the conversion script accordingly.

timsteuer added the bug Something isn't working label Dec 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Converted PrefixLM HF snapshot must enable cache for generation in config #780

Converted PrefixLM HF snapshot must enable cache for generation in config #780

timsteuer commented Dec 6, 2023

dakinggg commented Dec 6, 2023

timsteuer commented Dec 7, 2023

Converted PrefixLM HF snapshot must enable cache for generation in config #780

Converted PrefixLM HF snapshot must enable cache for generation in config #780

Comments

timsteuer commented Dec 6, 2023

Environment

To reproduce

Expected behavior

dakinggg commented Dec 6, 2023

timsteuer commented Dec 7, 2023