GitHub - dornik/doughnet: DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

DoughNet 🍩 A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer¹, Zhenjia Xu^1,2, Shuran Song^1,2
¹ Columbia University, ² Stanford University

Project website • Paper

Dependencies

Create the conda environment

conda env create -f environment.yml

Install additional submodules

git submodule init && git submodule update
conda activate doughnet
nvdiffrast: cd net/nvdiffrast && pip install -e . && cd ../..
sdftoolbox: cd sim/sdftoolbox && pip install -e . && cd ../..

Downloads

All required files are provided in this folder.
Download our dataset for training, testing on synthetic and real manipulation trajectories and place it in data/dataset.h5.
If you are only interested in the dataset, you can check the provided README for a simple data loader and a brief description of the structure.
Download our pretrained weights to reproduce the results in the paper and place them in weights/{ae,dyn}.pth.

Evaluation

Using the provided weights, the evaluation reproduces the main results from the paper. Note that due to dataset preprocessing and weights trained from scratch using this public code base, the results may vary slightly. Alternatively, train the model from scratch or create a new dataset as described below. Make sure to adapt the paths in the config files accordingly.

python net/prediction.py --config-name dyn "settings.test_only=True"

Training

Using the provided dataset, the autoencoder and the dynamics prediction are trained in two stages, as shown below. Alternatively, generate a custom dataset as described below.

Note that for multi-GPU training, e.g., using 2 GPUs, the settings.ddp flag needs to be set in the config. Run the scripts below with CUDA_VISIBLE_DEVICES=0,1 torchrun --standalone --nproc_per_node=2 {path_to_script} instead of python {path_to_script}.

Geometry-topology Autoencoder

python net/prediction.py --config-name ae
This saves the weights in the corresponding hydra output directory (i.e., outputs/{date}/{time when run was started}/best.pth). Either copy them to the default path (weights/ae.pth), or adapt the settings.resume_path in net/config/dyn.yaml accordingly before starting the next stage.

Dynamics Prediction

python net/prediction.py --config-name dyn
Again, the weights are saved in the corresponding hydra output directory. Follow the directions above to make sure that settings.test_path points to the desired weights when running subsequent evaluations.

Generation

Our simulation with topology annotation may be used to generate additional scenes or completely new datasets.

To this end, first, derive novel scene definitions from template.yaml, e.g., by adapting to_pos and to_quat (grasp pose), or close_d (final opening width).

Simulation

python sim/generate.py
This will create a log.pkl with particle-based information (and visualization.gif if render=True in config) in the scene directory.

Processing

python sim/process.py
This will process the simulated scenes in parallel and create data.h5 with additional mesh-based information.

Citation

@article{bauer2024doughnet,
  title={DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects},
  author={Bauer, Dominik and Xu, Zhenjia and Song, Shuran},
  journal={European Conference on Computer Vision (ECCV)},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
net		net
sim		sim
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DoughNet 🍩 A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dependencies

Create the conda environment

Install additional submodules

Downloads

Evaluation

Training

Geometry-topology Autoencoder

Dynamics Prediction

Generation

Simulation

Processing

Citation

About

Languages

License

dornik/doughnet

Folders and files

Latest commit

History

Repository files navigation

DoughNet 🍩 A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dependencies

Create the conda environment

Install additional submodules

Downloads

Evaluation

Training

Geometry-topology Autoencoder

Dynamics Prediction

Generation

Simulation

Processing

Citation

About

Resources

License

Stars

Watchers

Forks

Languages