Reward-learning SFT

This is a repository containing the implementation for Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment, which has been accepted to NeurIPS 2024. The code is partially built upon SPIN.

Setup

The following steps provide the necessary setup to run our codes.

conda create -n myenv python=3.10
conda activate myenv

python -m pip install .
python -m pip install flash-attn --no-build-isolation

huggingface-cli login --token "${your_access_token}"

bash run_RFT.sh

bash run_IRFT.sh

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
configs		configs
data		data
scripts		scripts
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py