Finetuning code? #1

StrangeTcy · 2023-07-07T16:17:35Z

That sounds massively interesting, and while we try to run inference and read the paper, should we expect the release of the finetuning code?

syzymon · 2023-07-08T12:23:27Z

Hi, thanks for interest in our work! That's right, we are currently supporting only inference. We are considering releasing examples for finetuning of our models in pytorch/huggingface API.

memray · 2023-07-11T07:31:27Z

@syzymon Is there any plan of releasing the training pipeline (is it based on the EasyLM library)?
Thank you!

SUSTechBruce · 2023-07-25T15:45:56Z

Hope to see your finetune code ASAP, since your work is very interesting!!!!

syzymon · 2023-07-25T16:25:45Z

The continued pretraining pipeline (used to train long_llama_3b base model) is based on EasyLM.

We are planning to release instruction tuning code in pytorch & checkpoints & examples early next week. Stay tuned!

puddleglum56 · 2023-07-30T01:46:57Z

Will you also be releasing pretraining code? Since the contrastive training seems to be a very important element of your great results, it would be nice if we could try recreating it

syzymon · 2023-07-30T08:24:26Z

We are working on LongLLaMA v2, which will be a bigger release. After that we will release the pretraining code which is in JAX, based on EasyLM codebase - same as used for openllama pretraining. You can expect the instruction finetuning code in pytorch to be out very soon (basically next week). There are no plans to implement FoT pretraining in PyTorch on our side, as our compute is based on TPUs. Stay tuned for LongLLaMA v2 which will definitely be out there in August!

syzymon · 2023-08-05T09:00:11Z

In case you haven't seen, the instruction code is already there! see https://twitter.com/s_tworkowski/status/1687620785379360768 and READMEs in this repo for more details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning code? #1

Finetuning code? #1

StrangeTcy commented Jul 7, 2023

syzymon commented Jul 8, 2023

memray commented Jul 11, 2023

SUSTechBruce commented Jul 25, 2023

syzymon commented Jul 25, 2023

puddleglum56 commented Jul 30, 2023

syzymon commented Jul 30, 2023

syzymon commented Aug 5, 2023

Finetuning code? #1

Finetuning code? #1

Comments

StrangeTcy commented Jul 7, 2023

syzymon commented Jul 8, 2023

memray commented Jul 11, 2023

SUSTechBruce commented Jul 25, 2023

syzymon commented Jul 25, 2023

puddleglum56 commented Jul 30, 2023

syzymon commented Jul 30, 2023

syzymon commented Aug 5, 2023