Removed extra spaces from Mistral instruction template that were causing model to misbehave #5517

fschuh · 2024-02-16T04:06:52Z

Mistral seems to be very sensitive to any spaces after [/INST] in its instruction template.

Note that the default instruction template provided by Mistral AI (found in the model's tokenizer_config.json) does not contain any spaces before [INST] and after [/INST]:

{{ bos_token }}{% for message in messages %}{% if (message['role'] == 'user') != (loop.index0 % 2 == 0) %}{{ raise_exception('Conversation roles must alternate user/assistant/user/assistant/...') }}{% endif %}{% if message['role'] == 'user' %}{{ '[INST] ' + message['content'] + ' [/INST]' }}{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token}}{% else %}{{ raise_exception('Only user and assistant roles are supported!') }}{% endif %}{% endfor %}

Unfortunately, that default template doesn't support system messages, but the one in text-generation-webui does.
However, the instruction template in text-generation-webui has some extra spaces in it.
While running some tests with langchain, I found Mistral-7b-instruct responding incorrectly with this instruction template.
Mixtral-8x7B-instruct was similarly affected.

Here's is a prompt that can reproduce the issue (tested with ExLlamav2_HF):

Correct instruction template (with no extra spaces before [INST] and after [/INST]):

Prompt:

[INST] You are a teacher coming up with questions to ask on a quiz.
Given the following document, please generate a question and answer based on that document.

Example Format:
<Begin Document>
...
<End Document>
QUESTION: question here
ANSWER: answer here

These questions should be detailed and be based explicitly on information in the document. Begin!

<Begin Document>
Our new TEK O2 technology makes our four-season waterproof pants even more breathable. It's guaranteed to keep you dry and comfortable – whatever the activity and whatever the weather. Size & Fit: Slightly Fitted through hip and thigh. \n\nWhy We Love It: Our state-of-the-art TEK O2 technology offers the most breathability we've ever tested. Great as ski pants, they're ideal for a variety of outdoor activities year-round.
<End Document> [/INST]

Response (correct):

QUESTION: What is the guarantee of the four-season waterproof pants with TEK O2 technology?
ANSWER: The four-season waterproof pants with TEK O2 technology are guaranteed to keep you dry and comfortable - whatever the activity and whatever the weather.

Incorrect instruction template (with an extra space before [INST] and after [/INST]):

Prompt:

 [INST] You are a teacher coming up with questions to ask on a quiz.
Given the following document, please generate a question and answer based on that document.

Example Format:
<Begin Document>
...
<End Document>
QUESTION: question here
ANSWER: answer here

These questions should be detailed and be based explicitly on information in the document. Begin!

<Begin Document>
Our new TEK O2 technology makes our four-season waterproof pants even more breathable. It's guaranteed to keep you dry and comfortable – whatever the activity and whatever the weather. Size & Fit: Slightly Fitted through hip and thigh. \n\nWhy We Love It: Our state-of-the-art TEK O2 technology offers the most breathability we've ever tested. Great as ski pants, they're ideal for a variety of outdoor activities year-round. 
<End Document> [/INST]

Response (incorrect):

1. What type of pants is the company's new TEK O2 technology designed for?
Answer: The company's new TEK O2 technology is designed for their four-season waterproof pants.

Note that the incorrect response above does not follow the format in the prompt's instructions.
Removing the extra spaces from the instruction template fixes it.

Checklist:

I have read the Contributing guidelines.

Merge dev branch

Merge dev branch (oobabooga#5257)

Merge dev branch

…ing Mistral to misbehave

BadisG · 2024-02-16T15:26:08Z

Note that the default instruction template provided by Mistral AI (found in the model's tokenizer_config.json) does not contain any spaces before [INST] and after [/INST]:

I thought it did contain space before [INST]

https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1

text = "< s >[INST] What is your favourite condiment? [/INST]"

Unless I'm misrepresenting what < s > means?

fschuh · 2024-02-16T18:49:31Z

Note that the default instruction template provided by Mistral AI (found in the model's tokenizer_config.json) does not contain any spaces before [INST] and after [/INST]:

I thought it did contain space before [INST]

https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1

text = "< s >[INST] What is your favourite condiment? [/INST]"

Unless I'm misrepresenting what < s > means?

You can add a space before [INST] and it won't break it.
What breaks it is the space after [/INST]. Please refer to my prompt examples above.

For consistency with Mistral's official template though, I've removed both extra spaces on this fix.
Take a closer look at Mistral's official template and you'll see that it doesn't have any spaces before [INST] and after [/INST].

oobabooga · 2024-02-16T22:26:20Z

instruction-templates/Mistral.yaml

@@ -4,7 +4,7 @@ instruction_template: |-
          {{- message['content'] -}}
      {%- else -%}
          {%- if message['role'] == 'user' -%}
-              {{-' [INST] ' + message['content'].rstrip() + ' [/INST] '-}}
+              {{-'[INST] ' + message['content'].rstrip() + ' [/INST]'-}}
          {%- else -%}
              {{-'' + message['content'] + '</s>' -}}


Maybe the space should go here then? Like

{{-' ' + message['content'] + '</s>' -}}

Maybe the space should go here then? Like

{{-' ' + message['content'] + '</s>' -}}

Mistral's official template doesn't add extra spaces for the assistant messages:

{% elif message['role'] == 'assistant' %}{{ message['content'] + eos_token}}

For consistency, I'd keep it as is, which is like Mistral's default.

Yes, that's correct. But there seems to be a space after the </s> at least:

{{ bos_token }} {% for message in messages %} {% if (message['role'] == 'user') != (loop.index0 % 2 == 0) %} {{ raise_exception('Conversation roles must alternate user/assistant/user/assistant/...') }} {% endif %} {% if message['role'] == 'user' %} {{ '[INST] ' + message['content'] + ' [/INST]' }} {% elif message['role'] == 'assistant' %} {{ message['content'] + eos_token + ' ' }} {% else %} {{ raise_exception('Only user and assistant roles are supported!') }} {% endif %} {% endfor %}

https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1/blob/main/tokenizer_config.json#L32

Maybe not for the Mixtral one (see below). It's kind of frustrating that not even Mistral seems to know what the Mistral prompt should be.

{{ bos_token }} {% for message in messages %} {% if (message['role'] == 'user') != (loop.index0 % 2 == 0) %} {{ raise_exception('Conversation roles must alternate user/assistant/user/assistant/...') }} {% endif %} {% if message['role'] == 'user' %} {{ '[INST] ' + message['content'] + ' [/INST]' }} {% elif message['role'] == 'assistant' %} {{ message['content'] + eos_token }} {% else %} {{ raise_exception('Only user and assistant roles are supported!') }} {% endif %} {% endfor %}

https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1/blob/main/tokenizer_config.json#L42

oobabooga added 30 commits December 14, 2023 22:39

Merge pull request oobabooga#4927 from oobabooga/dev

c3e0fcf

Merge dev branch

Merge pull request oobabooga#4937 from oobabooga/dev

443be39

Merge dev branch

Merge pull request oobabooga#4961 from oobabooga/dev

7be0983

Merge dev branch

Merge pull request oobabooga#4980 from oobabooga/dev

b28020a

Merge dev branch

Merge pull request oobabooga#4988 from oobabooga/dev

781367b

Merge dev branch

Merge pull request oobabooga#5002 from oobabooga/dev

71eb744

Merge dev branch

Merge pull request oobabooga#5005 from oobabooga/dev

5b791ca

Merge dev branch

Merge pull request oobabooga#5011 from oobabooga/dev

c1f78db

Merge dev branch

Merge pull request oobabooga#5012 from oobabooga/dev

489f4a2

Merge dev branch

Merge pull request oobabooga#5022 from oobabooga/dev

11288d1

Merge dev branch

Merge pull request oobabooga#5039 from oobabooga/dev

4b25acf

Merge dev branch

Merge pull request oobabooga#5073 from oobabooga/dev

af87609

Merge dev branch

Merge pull request oobabooga#5078 from oobabooga/dev

19d1374

Merge dev branch

Merge pull request oobabooga#5100 from oobabooga/dev

3fd7073

Merge dev branch

Merge pull request oobabooga#5132 from oobabooga/dev

3e3a66e

Merge dev branch

Merge pull request oobabooga#5152 from oobabooga/dev

3f28925

Merge dev branch

Merge pull request oobabooga#5163 from oobabooga/dev

c54d1da

Merge dev branch

Merge pull request oobabooga#5181 from oobabooga/dev

8ea3f31

Merge dev branch

Merge pull request oobabooga#5195 from oobabooga/dev

e169993

Merge dev branch

Merge pull request oobabooga#5199 from oobabooga/dev

ad1ff53

Merge dev branch

Merge pull request oobabooga#5220 from oobabooga/dev

2dc8db8

Merge dev branch

Merge pull request oobabooga#5253 from oobabooga/dev

61e4bfe

Merge dev branch

Merge pull request oobabooga#5266 from oobabooga/dev

d8c3a5b

Merge dev branch (oobabooga#5257)

Merge pull request oobabooga#5347 from oobabooga/dev

1343aa3

Merge dev branch

Merge pull request oobabooga#5348 from oobabooga/dev

837bd88

Merge dev branch

Merge pull request oobabooga#5379 from oobabooga/dev

e7a760e

Merge dev branch

Merge pull request oobabooga#5404 from oobabooga/dev

4f3fdf1

Merge dev branch

Merge pull request oobabooga#5452 from oobabooga/dev

a329db0

Merge dev branch

Merge pull request oobabooga#5453 from oobabooga/dev

0f134bf

Merge dev branch

Merge pull request oobabooga#5496 from oobabooga/dev

dc6adef

Merge dev branch

oobabooga and others added 2 commits February 14, 2024 11:32

Merge pull request oobabooga#5502 from oobabooga/dev

771c592

Merge dev branch

Removed extra spaces from Mistral instruction template that were caus…

16b8d4b

…ing Mistral to misbehave

oobabooga reviewed Feb 16, 2024

View reviewed changes

oobabooga merged commit fa1019e into oobabooga:dev Feb 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removed extra spaces from Mistral instruction template that were causing model to misbehave #5517

Removed extra spaces from Mistral instruction template that were causing model to misbehave #5517

fschuh commented Feb 16, 2024 •

edited

Loading

BadisG commented Feb 16, 2024 •

edited

Loading

fschuh commented Feb 16, 2024

oobabooga Feb 16, 2024

fschuh Feb 16, 2024

oobabooga Feb 17, 2024

oobabooga Feb 17, 2024

Removed extra spaces from Mistral instruction template that were causing model to misbehave #5517

Removed extra spaces from Mistral instruction template that were causing model to misbehave #5517

Conversation

fschuh commented Feb 16, 2024 • edited Loading

Correct instruction template (with no extra spaces before [INST] and after [/INST]):

Prompt:

Response (correct):

Incorrect instruction template (with an extra space before [INST] and after [/INST]):

Prompt:

Response (incorrect):

Checklist:

BadisG commented Feb 16, 2024 • edited Loading

fschuh commented Feb 16, 2024

oobabooga Feb 16, 2024

Choose a reason for hiding this comment

fschuh Feb 16, 2024

Choose a reason for hiding this comment

oobabooga Feb 17, 2024

Choose a reason for hiding this comment

oobabooga Feb 17, 2024

Choose a reason for hiding this comment

fschuh commented Feb 16, 2024 •

edited

Loading

BadisG commented Feb 16, 2024 •

edited

Loading