Official repo for paper: Search for Efficient Large Language Models
This paper is accepted by NeurIPS 2024
- Revise search space at
experiments/llama-7b.yaml
- Directly start search with
sh run-search-llama.sh
@inproceedings{
shen2024search,
title = {Search for Efficient Large Language Models},
author = {Shen, Xuan and Zhao, Pu and Gong, Yifan and Kong, Zhenglun and Zhan, Zheng and Wu, Yushu and Lin, Ming and Wu, Chao and Lin, Xue and Wang, Yanzhi},
booktitle = {NeurIPS},
year = {2024},
}
The code is mainly based on the NAS work AutoFormer.