Skip to content
forked from flippe3/chat-ltu

Open source implementation of InstructGPT

License

Notifications You must be signed in to change notification settings

xunyuw/chat-ltu

 
 

Repository files navigation

Chat-LTU

This is a chatbot project for the course D7058E at Luleå Univeristy of Technology. We try to implement something similar to Instruct-GPT or Chat-GPT mostly based on the papers and the rlhf blogpost from Huggingface.

Todo:

  • Implement PPO2 for faster RL fine-tuning.
  • Implement the website that is partially done to gather real human data.
  • Upload reward model and fine-tuned model to Huggingface for open source use.

About

Open source implementation of InstructGPT

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 88.1%
  • HTML 11.9%