Our dataset is developed based on NUS LV Multiple-Human Parsing Dataset v2.0. Please download the source data from the link.
We use MMPose to estimate the pose using the "vitpose_h" model. You can download the pose estimation results from this link.
After downloading the dataset, unzip the file and put them under the dataset folder with the following structure:
./LV-MHP-v2
├── train
├── images
├── parsing_annos
└── pose_estimation
└── shhq_dataset
├── images
├── parsing_annos
└── pose_estimation
Based on the preprocessed data, we propose a comprehensive training data generation engine to synthesize paired data. The data generation codes can be found here.