This is a repository containing the implementation for Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment, which has been accepted to NeurIPS 2024. The code is partially built upon SPIN.
The following steps provide the necessary setup to run our codes.
- Create a Python virtual environment with Conda:
conda create -n myenv python=3.10
conda activate myenv
- Install the following Python dependencies to run the codes.
python -m pip install .
python -m pip install flash-attn --no-build-isolation
- Login to your huggingface account for downloading models
huggingface-cli login --token "${your_access_token}"
bash run_RFT.sh
bash run_IRFT.sh