fix: the source part should not participate in loss calculation in SFT stage #762

xffxff · 2023-10-10T11:14:26Z

In the SFT stage, it's essential that the source part doesn't contribute to the loss calculation, only the completion part should be considered. To address this issue, I've adjusted the labels for the source part to be set as -100. This specific value, -100, corresponds to the default "ignore index" in the torch.nn.CrossEntropyLoss function. Importantly, both OPT and LLAMA models utilize torch.nn.CrossEntropyLoss for their loss calculations, as seen in OPTForCausalLM and LLamaForCausalLM. As a result, there is no need to make any modifications to the way the loss is computed, as it will automatically handle the source part as intended.

The training loss of opt-350m with Dahoas/rm-static as dataset

…T stage

AndyW-llm · 2023-11-28T20:25:36Z

The function is now moved to "DeepSpeedExamples/applications/DeepSpeed-Chat/dschat/utils/data/data_utils.py"
This solution seems to assume single-turn conversation, please consider cases for multi-turn conversation.

fix: the prompt part should not participate in loss calculation in SF…

8e58c74

…T stage

xffxff requested review from jeffra, samyam, tjruwase, ShadenSmith, conglongli, awan-10, eltonzheng, minjiaz, RezaYazdaniAminabadi, duli2012, mrwyattii, yaozhewei, arashb and xiaoxiawu-microsoft as code owners October 10, 2023 11:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: the source part should not participate in loss calculation in SFT stage #762

fix: the source part should not participate in loss calculation in SFT stage #762

xffxff commented Oct 10, 2023 •

edited

Loading

AndyW-llm commented Nov 28, 2023

fix: the source part should not participate in loss calculation in SFT stage #762

Are you sure you want to change the base?

fix: the source part should not participate in loss calculation in SFT stage #762

Conversation

xffxff commented Oct 10, 2023 • edited Loading

AndyW-llm commented Nov 28, 2023

xffxff commented Oct 10, 2023 •

edited

Loading