Official PyTorch implementation of "Meta-Learning with Self-Improving Momentum Target" (NeurIPS 2022) by Jihoon Tack, Jongjin Park, Hankook Lee, Jaeho Lee, Jinwoo Shin.
TL;DR: We propose a meta-learning algorithm to generate a target model from which we distill the knowledge to the meta-model, forming a virtuous cycle of improvements.
conda create -n simt python=3.8 -y
conda activate simt
pip install torch==1.8.1+cu111 torchvision==0.9.1+cu111 -f https://download.pytorch.org/whl/torch_stable.html
pip install torchmeta tensorboardX
Download the following datasets and place at /data
folder
- Regression
- Classification
- mini-ImageNet, tiered-ImageNet, CUB, Cars
- All classification datasets require preprocessing with torchmeta library
The options for the training method are as follows:
<MODE>
: {maml
,anil
,metasgd
,protonet
}<MODEL>
: {conv4
,resnet12
}<DATASET>
: {shapenet
,pose
,miniimagenet
,tieredimagenet
}, note thatpose
indicates Pascal dataset.- One can use
--simt
option to train the backbone meta-learning scheme<MODE>
with SiMT.
python main.py --mode <MODE> --model <MODEL> --dataset <DATASET>
To train SiMT, one should choose the appropriate hyperparameters including momentum coefficient ETA
, weight hyperparameter LAM
, and dropout probability P
.
python main.py --simt --mode <MODE> --model <MODEL> --dataset <DATASET> --eta ETA --lam LAM --drop_p P
The options for the evaluation are as follows:
<PATH>
: the path of the pre-trained checkpoints with the best validation accuracy (e.g.,./logs/experiment_name/best.model
).<MODE>
: {maml
,anil
,metasgd
,protonet
}<MODEL>
: {conv4
,resnet12
}<DATASET>
: {shapenet
,pose
,miniimagenet
,tieredimagenet
,cub
,cars
}, note thatpose
indicates Pascal dataset.- One can use
--simt
option to evaluate with the momentum network.
python eval.py --mode <MODE> --model <MODEL> --dataset <DATASET> --load_path <PATH>
python main.py --simt --mode <MODE> --model <MODEL> --dataset <DATASET> --load_path <PATH>
@inproceedings{tack2022meta,
title={Meta-Learning with Self-Improving Momentum Target},
author={Jihoon Tack and Jongjin Park and Hankook Lee and Jaeho Lee and Jinwoo Shin},
booktitle={Advances in Neural Information Processing Systems},
year={2022}
}