Bug: Why llamafile don't remove end token like <|eot_id|> or <end_of_turn>? #630

jeezrick · 2024-11-15T06:21:11Z

Contact Details

What happened?

When I use llamafile with python api. But for 2 models I use, they all retain the end token in response string, that I need to manually remove, is that my problem?
like this :

        if self.model_string == "LLaMA_CPP": # why llama_file don't remove end token?
            self.response_str = self.response_str.replace("<|eot_id|>", "")
        if self.model_string == "gemma-2b-it":
            self.response_str = self.response_str.replace("<end_of_turn>", "")

Version

llamafile v0.8.4

What operating system are you seeing the problem on?

Linux

Relevant log output

model_gemma("I have a head of broccoli, and a cabbage. How many fruits do I have?")

output:

'You have **zero** fruits! 🥦 🥬 \n\nBroccoli and cabbage are both vegetables, not fruits. \n<end_of_turn>'

The text was updated successfully, but these errors were encountered:

jeezrick added bug low severity labels Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Why llamafile don't remove end token like <|eot_id|> or <end_of_turn>? #630

Bug: Why llamafile don't remove end token like <|eot_id|> or <end_of_turn>? #630

jeezrick commented Nov 15, 2024

Bug: Why llamafile don't remove end token like <|eot_id|> or <end_of_turn>? #630

Bug: Why llamafile don't remove end token like <|eot_id|> or <end_of_turn>? #630

Comments

jeezrick commented Nov 15, 2024

Contact Details

What happened?

Version

What operating system are you seeing the problem on?

Relevant log output