Codebase for video tokenization, based on our paper Towards Tokenized Human Dynamics Representation.
apt-get install ffmpeg
pip install torch==1.8
pip install torchvision
pip install pytorch-lightning
pip install pytorch-lightning-bolts
pip install aniposelib wandb gym test-tube ffmpeg-python matplotlib easydict scikit-learn
- Make a directory besides this repo and name it
aistplusplus
- Download from AIST++ website until it looks like
├── annotations
│ ├── cameras
│ ├── ignore_list.txt
│ ├── keypoints2d
│ ├── keypoints3d
│ ├── motions
│ └── splits
└── video_list.txt
-
Write one configuration file, e.g.,
configs/tan.yaml
. -
Run
python pretrain.py --cfg configs/tan.yaml
with GPU, which will create a folder underlogs
for this run. Folder name specified by theNAME
in configuration file. Then runpython cluster.py --cfg configs/tan.yaml
(CPU-only) and check results indemo.ipynb
. -
Or you can download and unzip my training result into
logs
folder from here.