4K context doesn't work? #3329

Zoraman26 · 2023-07-26T20:58:14Z

Describe the bug

Not sure if i'm doing things wrong but i downloaded a 4K context size model and any time i try to make an API request i still get errors bout too large acontext size

Is there an existing issue for this?

I have searched the existing issues

Reproduction

Download TheBloke_Llama-2-13B-GPTQ set max lenth to 4096, it no worky

Screenshot

Logs

N/A

System Info

NVIDIA GeForce RTX 3060
AMD Ryzen 5 7600X
64GB Ram

RandomInternetPreson · 2023-07-27T01:24:55Z

Try changing the compress_pos_emb value to 2 instead of 1.

matatonic · 2023-07-27T01:35:35Z

See #3153, (after changing compress_pos_emb to 2), it still wont work.

CrazyShipOne · 2023-07-27T08:08:57Z

I created settings.yaml as copy of settings-template.yaml, and changed truncation_length to fit model's context size, like 4096. Hopefully in following versions this parameter could be changed automatically when loading models.

matatonic · 2023-07-27T15:17:38Z

I created settings.yaml as copy of settings-template.yaml, and changed truncation_length to fit model's context size, like 4096. Hopefully in following versions this parameter could be changed automatically when loading models.

You can also update truncation_length on a per-model basis using the models/user-config.yaml file, which may better suit your needs. Also, if you haven't updated it, you should update your models/config.yaml and characters/instruction-following/*.yaml files.

I also just realized that this was a llama-2 model, you don't need to use compress_pos_emb to get 4k with those. Start by updating your models/config.yaml and you may find it just works.

missionfloyd · 2023-07-28T06:52:37Z

I created settings.yaml as copy of settings-template.yaml, and changed truncation_length to fit model's context size, like 4096. Hopefully in following versions this parameter could be changed automatically when loading models.

Set it how you want it and click "Save settings" on the models tab. The settings will be loaded automatically when you load the model.

epolewski · 2023-08-07T20:45:07Z

Did you download the model when it first dropped? Looks like the old config.json has 2048 set and it was updated to 4096 a couple days later.

github-actions · 2023-09-19T23:16:18Z

This issue has been closed due to inactivity for 6 weeks. If you believe it is still relevant, please leave a comment below. You can tag a developer in your comment.

Zoraman26 added the bug Something isn't working label Jul 26, 2023

github-actions bot added the stale label Sep 19, 2023

github-actions bot closed this as completed Sep 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4K context doesn't work? #3329

4K context doesn't work? #3329

Zoraman26 commented Jul 26, 2023

RandomInternetPreson commented Jul 27, 2023

matatonic commented Jul 27, 2023

CrazyShipOne commented Jul 27, 2023

matatonic commented Jul 27, 2023

missionfloyd commented Jul 28, 2023

epolewski commented Aug 7, 2023

github-actions bot commented Sep 19, 2023

4K context doesn't work? #3329

4K context doesn't work? #3329

Comments

Zoraman26 commented Jul 26, 2023

Describe the bug

Is there an existing issue for this?

Reproduction

Screenshot

Logs

System Info

RandomInternetPreson commented Jul 27, 2023

matatonic commented Jul 27, 2023

CrazyShipOne commented Jul 27, 2023

matatonic commented Jul 27, 2023

missionfloyd commented Jul 28, 2023

epolewski commented Aug 7, 2023

github-actions bot commented Sep 19, 2023