GGUF conversion doesn't respect tokenizer config add_bos/eos_token setting #3966

KerfuffleV2 · 2023-11-06T10:25:54Z

This causes problems with at least one model (Yi), see discussion here: 01-ai/Yi#5

The automatic BOS that gets prepended apparently confuses the model.

SpecialVocab in gguf.py already loads tokenizer_config.json (although only as a fallback currently). The main question is probably how to add it to the GGUF file - what key, etc.

The text was updated successfully, but these errors were encountered:

shinomakoi · 2023-11-07T00:55:19Z

Have been testing Yi-6B over the server API, it becomes very repetitive and incoherent quickly. So instead of passing the prompt as a string (in Python) I tried passing it as a list like prompt = [2, "my prompt here"] and (2 being the override BOS Token ID I believe). This seems to completely fix the model, with no more repetition or incoherency issues. So yea something wrong with the automatic BOS.

KerfuffleV2 · 2023-11-07T07:19:15Z

These models are looking pretty interesting now that the 200K (!) context versions were released. If it works as described, a 34B with 200K context is pretty insane.

KerfuffleV2 · 2023-11-18T11:53:00Z

Should be resolved by #4040

KerfuffleV2 added the bug-unconfirmed label Nov 6, 2023

KerfuffleV2 closed this as completed Nov 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GGUF conversion doesn't respect tokenizer config add_bos/eos_token setting #3966

GGUF conversion doesn't respect tokenizer config add_bos/eos_token setting #3966

KerfuffleV2 commented Nov 6, 2023

shinomakoi commented Nov 7, 2023

KerfuffleV2 commented Nov 7, 2023

KerfuffleV2 commented Nov 18, 2023

GGUF conversion doesn't respect tokenizer config add_bos/eos_token setting #3966

GGUF conversion doesn't respect tokenizer config add_bos/eos_token setting #3966

Comments

KerfuffleV2 commented Nov 6, 2023

shinomakoi commented Nov 7, 2023

KerfuffleV2 commented Nov 7, 2023

KerfuffleV2 commented Nov 18, 2023