Skip to content

Latest commit

 

History

History
199 lines (125 loc) · 9.17 KB

README.md

File metadata and controls

199 lines (125 loc) · 9.17 KB

Generative Cellular Automata (GCA)

Alt text

Alt text

This repository contains code for our work "Learning to Generate 3D Shapes with Generative Cellular Automata" and "Probabilistic Implicit Scene Completion", which are accepted to ICLR 2021 and 2022 (spotlight), respectively. The first paper introduces a model named Generative Cellular Automata (GCA), which formulates the shape generation process as sampling from the transition kernel of a Markov chain, where the sampling chain eventually evolves to the full shape of the learned distribution. The transition kernel employs the local update rules of cellular automata, effectively reducing the search space in a high-resolution 3D grid space by exploiting the connectivity and sparsity of 3D shapes.

The second paper introduces a model name continuous Generative Cellular Automata (cGCA), which extends GCA to produce continuous geometry from incomplete point cloud. Instead of learning a transition kernel on sparse voxels as in GCA, cGCA learns the transition kernel operating on sparse voxel embedding, which additionally contains a local latent code for each occupied cell. Decoding the last state (sparse voxel embedding) produces continuous surface. By extending the scalability of GCA, our work is the first work to tackle the problem of completing multiple continuous surfaces in scene level.

The repository currently contains pretrained models and datasets for the experiments on ShapeNet and ShapeNet scenes in Probabilistic Implicit Scene Completion. We plan on releasing the experiments for 3DFront soon. Please contact [email protected] if you have any questions :)

Installation & Data Preparation

  1. Anaconda environment installations for training & testing
conda create -n gca python=3.8
conda activate gca

# install torch
# for cuda 11.2
pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 -f https://download.pytorch.org/whl/torch_stable.html
# for cuda 10.2
pip install torch==1.7.1 torchvision==0.8.2

# install MinkowskiEngine (sparse tensor processing library)
# the model was trained on v0.5.0, but the code runs on v0.5.4 (the latest version as of 22/04/24) as well.
conda install openblas-devel -c anaconda 
export CXX=g++-7
pip install -U git+https://github.com/NVIDIA/MinkowskiEngine -v --no-deps --install-option="--blas_include_dirs=${CONDA_PREFIX}/include" --install-option="--blas=openblas"

# install all other requirements
pip install -r requirements.txt

# install torch-scatter
# for cuda 10.2 (for other cuda versions you can find installation guide in https://github.com/rusty1s/pytorch_scatter)
# note that --no-index option might be crucial
pip install --no-index torch-scatter -f https://pytorch-geometric.com/whl/torch-1.7.1+cu101.html

The repo was tested with NVIDIA 2080ti GPU (11GB). Note that MinkowskiEngine might be difficult to install, please see the issue of the MinkowskiEngine or this repo for help.

  1. Data preparation and Pretrained Models

Link contains the datasets (sdf & preprocessed sparse voxel embedding for ShapeNet, ShapnetNet scene) and pretrained models (GCA, cGCA, cGCA w/ cond.). Place the files with directory as below. This link contains all the data except for conv_onet (input point cloud of ShapeNet scene dataset), which can be downloaded from here and unzipped/renamed as conv_onet from synthetic_room_dataset.zip. All directories not specified as downloadable from conv_onet repo can be obtained from our google drive. Note that we call the Shapenet Scene datasets as synthetic as abbreviation.

\cgca
   - main.py
   ...
   - data/
     - shapenet_sdf/ (contains surface and sdf points of shapenet objects)
       - sofa
       - chair
       - table
     - synthetic_room_sdf/ (won't need it unless if you want to start training from autoencoder)
       - rooms_04/
       - ....
     - conv_onet/ (contains point cloud input, required for training transition model)
       - rooms_04/ (should be downloaded from original conv_onet repo)
       - rooms_05/ (should be downloaded from original conv_onet repo)
       - rooms_06/ (should be downloaded from original conv_onet repo)
       - rooms_07/ (should be downloaded from original conv_onet repo)
       - rooms_08/ (should be downloaded from original conv_onet repo)
       - val/ 
       - test/
     - embeddings/ (contains preprocessed sparse voxel embeddings, required for training transition mode, and pretrained autoencoders)
       - shapenet/
         - sofa-vox_64-sdf-step_700k/ 
         - chair-vox_64-sdf-step_500k/
         - table-vox_64-sdf-step_500k/
       - synthetic/
         - vox_64-step_100k/
   - pretrained_models/
     - ...

Note that, for embeddings we use 10 random translations for data augmentation of the local latent codes.

Training

We provide training scripts for GCA and cGCA on shapenet/shapenet scene dataset.

Training GCA

Alt text

You can train the GCA by running,

python main.py --config configs/gca-shapenet-vox=64.yaml --override "obj_class=chair" -l log/gca-chair

For other datasets/configurations you may use other configs. Here config flag is for determining the default config and override options provide the a functionality for overriding the values of the default configs, for example, in this case, obj_class to chair. You may want to override obj_min_rate to different values to reproduce the results of Table 1 from our paper.

GCA only uses the occupancies of the sparse voxels in the dataset, but the released dataset contains the local embeddings as well for integration with cGCA. If you want to create your own dataset, you only need coordinates of occupied cells of the surface.

Training cGCA

Alt text

Training cGCA works in 2 steps.

  1. Training autoencoder for sparse voxel embedding

To train the autoencoder model, run

python main.py --config configs/cgca_autoencoder-shapenet-vox=64.yaml --override "obj_class=chair" -l log/autoencoder-chair
  1. Training the cGCA transition model

You do not need to train the autoencoder from scratch to train the transition model. We have already preprocessed the ground truth sparse voxel embedding for the dataset in the above data preparation step. The sparse voxel embeddings and pretrained autoencoder models (with configs) are in data/embeddings/shapenet/{obj_class} for shapenet dataset or data/embeddings/synthetic for shapenet scene dataset.

To train the transition model, run

python main.py --config configs/cgca_transition-shapenet-vox=64.yaml --override "obj_class=chair" -l log/cgca-chair

For other datasets/configurations you may use other configs. Here config flag is for determining the default config and override options provide the a functionality for overriding the values of the default configs, for example, in this case, obj_class to chair. You may want to override obj_min_rate to different values to reproduce the results of Table 1 from our paper.

Log visualization

The log files for the tensorboard visualization is available on the log directory. To view the logs, run

tensorboard --logdir ./log

and enter the corresponding website with port on your web browser.

Testing

Download the pretrained models as described in the above. If you only want to perform testing,

For example, to reproduce shapenet chair results, run

python main.py --test --resume-ckpt pretrained_models/cgca-chair/ckpts/ckpt-step-300000 -l log/chair_test

This script 1) calculates the metrics reported in the paper, 2) creates meshes in log/chair_test/step-299999/mesh 3) creates images of voxels using matplotlib in log/chair_test/step-299999/vis .

Citation

If you find this repo useful for your research or use any part of the code, please cite

@inproceedings{
	zhang2021gca,
	title={Learning to Generate 3D Shapes with Generative Cellular Automata},
	author={Dongsu Zhang and Changwoon Choi and Jeonghwan Kim and Young Min Kim},
	booktitle={International Conference on Learning Representations},
	year={2021},
	url={https://openreview.net/forum?id=rABUmU3ulQh}
}
@inproceedings{
	zhang2022cgca,
	title={Probabilistic Implicit Scene Completion},
	author={Dongsu Zhang and Changwoon Choi and Inbum Park and Young Min Kim},
	booktitle={International Conference on Learning Representations},
	year={2022},
	url={https://openreview.net/forum?id=BnQhMqDfcKG}
}

Acknowledgements

Our work is partially based on the open source work: MinkowskiEngine, torch-scatter, deep sdf convolutional occupancy networks. We highly appreciate their contributions.