This repo is the official implementation for the CVPR2022 paper "GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping".
Omid Taheri · Vassilis Choutas · Michael J. Black · Dimitrios Tzionas
GOAL is a generative model that generates full-body motion of human body that walk and grasp unseen 3D objects. GOAL consists of two main steps:
- GNet generates the final grasp of the motion.
- MNet generates the motion from the starting to the grasp frame. It is trained on the GRAB dataset. For more details please refer to the Paper or the project website.
Below you can see some generated whole-body static grasps from GNet. The hand close-ups are from the same grasp, and for better visualization:
Apple | Binoculars | Toothpaste |
---|---|---|
Below you can see some generated whole-body motions that walk and grasp 3D objects using MNet:
Camera | Mug | Apple |
---|---|---|
For more details check out the YouTube video below.
- Description
- Requirements
- Installation
- Getting Started
- Examples
- Citation
- License
- Acknowledgments
- Contact
This implementation:
- Can run GOAL on arbitrary objects provided by users (incl. computing on the fly the BPS representation for them).
- Provides a quick and easy demo on google colab to generate fullbody grasps by GNet for any given object (MNet results coming soon).
- Can run GOAL on the test objects of our dataset (with pre-computed object centering and BPS representation).
- Can retrain GNet and MNet, allowing users to change details in the training configuration.
This package has the following requirements:
- Pytorch>=1.7.1
- Python >=3.7.0
- pytroch3d >=0.2.0
- MANO
- SMPLX
- bps_torch
- psbody-mesh
To install the dependencies please follow the next steps:
- Clone this repository:
git clone https://github.com/otaheri/GOAL cd GOAL
- Install the dependencies by the following command:
pip install -r requirements.txt
For a quick demo of GNet you can give it a try on google-colab here (Coming Soon).
Inorder to use the GOAL models please follow the steps below:
- Download the GRAB dataset from the GRAB website, and follow the instructions there to extract the files.
- Process the GNet data by running the command below.
python data/process_gnet_data.py --grab-path /path/to/GRAB --smplx-path /path/to/smplx/models/
- Process the MNet data by running the command below.
python data/process_mnet_data.py --grab-path /path/to/GRAB --smplx-path /path/to/smplx/models/
- Please download the GNet and MNet models from our website and put them in the folders as below.
GOAL
├── models
│ │
│ ├── GNet_model.pt
│ ├── MNet_model.pt
│ └── ...
│
│
.
.
.
- Download body models following the steps on the SMPLX repo (skip this part if you already followed this for GRAB dataset).
After installing the GOAL package, dependencies, and downloading the data and the models from SMPLX website, you should be able to run the following examples:
-
python test/GOAL.py --work-dir /path/to/work/dir --grab-path /path/to/GRAB --smplx-path /path/to/models/
-
To retrain GrabNet with a new configuration, please use the following code.
python train/GNet_train.py --work-dir path/to/work/dir --grab-path /path/to/GRAB --smplx-path /path/to/models/ --expr-id EXPERIMENT_ID
-
python train/GNet_inference.py --work-dir path/to/work/dir --grab-path /path/to/GRAB --smplx-path /path/to/models/ --expr-id EXPERIMENT_ID
-
To retrain GrabNet with a new configuration, please use the following code.
python train/MNet_train.py --work-dir path/to/work/dir --grab-path /path/to/GRAB --smplx-path /path/to/models/ --expr-id EXPERIMENT_ID
-
python train/MNet_inference.py --work-dir path/to/work/dir --grab-path /path/to/GRAB --smplx-path /path/to/models/ --expr-id EXPERIMENT_ID
@inproceedings{taheri2021goal,
title = {{GOAL}: {G}enerating {4D} Whole-Body Motion for Hand-Object Grasping},
author = {Taheri, Omid and Choutas, Vasileios and Black, Michael J. and Tzionas, Dimitrios},
booktitle = {Conference on Computer Vision and Pattern Recognition ({CVPR})},
year = {2022},
url = {https://goal.is.tue.mpg.de}
}
@inproceedings{GRAB:2020,
title = {{GRAB}: {A} Dataset of Whole-Body Human Grasping of Objects},
author = {Taheri, Omid and Ghorbani, Nima and Black, Michael J. and Tzionas, Dimitrios},
booktitle = {European Conference on Computer Vision ({ECCV})},
year = {2020},
url = {https://grab.is.tue.mpg.de}
}
Software Copyright License for non-commercial scientific research purposes. Please read carefully the terms and conditions in the LICENSE file and any accompanying documentation before you download and/or use the GOAL data, model and software, (the "Data & Software"), including 3D meshes (body and objects), images, videos, textures, software, scripts, and animations. By downloading and/or using the Data & Software (including downloading, cloning, installing, and any other use of the corresponding github repository), you acknowledge that you have read these terms and conditions, understand them, and agree to be bound by them. If you do not agree with these terms and conditions, you must not download and/or use the Data & Software. Any infringement of the terms of this agreement will automatically terminate your rights under this License.
This research was partially supported by the International Max Planck Research School for Intelligent Systems (IMPRS-IS) and the Max Planck ETH Center for Learning Systems (CLS); Omid Taheri is with IMPRS-IS and Vassilis Choutas is with CLS.
We thank:
- Tsvetelina Alexiadis for helpign with the MTurk study.
- Taylor McConnell for the voice-overs.
- Joachim Tesch for the help with renderings.
- Jonathan Williams for the website design, and Benjamin Pellkofer for the IT and web support.
The code of this repository was implemented by Omid Taheri and Vassilis Choutas.
For questions, please contact [email protected].
For commercial licensing (and all related questions for business applications), please contact [email protected].