Efficient-NLP-multistage-training

Source code of the IJCAI 2023 paper: "Efficient NLP Model Finetuning via Multistage Data Filtering"

Main Organization of the Code

We provide the three stage training python scripts for glue/amazon/ag news datasets. dataset.py is for data preprocessing.

Reference

If you find the code useful, please cite the following papers:

Efficient NLP Model Finetuning via Multistage Data Filtering. Xu Ouyang, Shahina Mohd Azam Ansari, Felix Xiaozhu Lin, Yangfeng Ji. the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023)

@inproceedings{ouyang2023efficient,
  title={Efficient NLP Model Finetuning via Multistage Data Filtering},
  author={Ouyang, X and Ansari, S and Lin, F and Ji, Y},
  booktitle={INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.DS_Store		.DS_Store
README.md		README.md
dataset.py		dataset.py
three-stage-training-ag.py		three-stage-training-ag.py
three-stage-training-amz.py		three-stage-training-amz.py
three-stage-training-glue.py		three-stage-training-glue.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Efficient-NLP-multistage-training

Main Organization of the Code

Reference

About

Releases

Packages

Languages

HarperCy/efficient-NLP-multistage-training

Folders and files

Latest commit

History

Repository files navigation

Efficient-NLP-multistage-training

Main Organization of the Code

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages