Vitruvio: Conditional Variational Autoencoder (CVAE) to Generate Building Meshes via Single Perspective Sketches
Paper Automation In Construction ArXiv 2022
This method allows designers to automatically generate 3D representations in real-time based on their initial sketches and thus communicate effectively and intuitively to the client. Vitruvio adapts the Occupancy Network to perform single view reconstruction (SVR), a technique for creating 3D representations from a single image. Vitruvio achieves: (1) an 18% increase in the reconstruction accuracy and (2) a 26% reduction in the inference time compared to the Occupancy Network on one thousand buildings provided by the New York municipality. This research investigates the effect that the building orientation has on the reconstruction quality, discovering that Vitruvio can capture fine-grain details in complex buildings when their native orientation is preserved during training, as opposed to the SVR's standard practice that aligns every building to its canonical pose.
- We introduced learning-based method for single perspective sketch-to-3D applications in the Architecture Engineering and Construction (AEC) industry.
- We adapted a previous state-of-the-art Conditional Variational Autoencoder (Occupancy Network) to scalable datasets.
- We showed qualitatively and quantitatively how the building orientation affects the reconstruction.
We need to consider the orientation of the building for this sketch to 3D method. Use this script to center and align the dataset, while tracking the orientation. Make sure to install all the packages in your environemnt and use python align.py --objs '<FOLDER PATH>' |
---|
You can generate your own dataset taking the .obj files from RealCity3D and run the code in Occupancy Network to get the occupancy field. For any other inqueries please reach out to Alberto Tono using [email protected]
This repository uses the code from the paper
- Occupancy Networks - Learning 3D Reconstruction in Function Space as a baseline.
- Paper Arxiv 2021 submission
- Dataset, Weights Request & Contribution Form [5GB]
- Dataset Split and Orientation Tracker File [26kb]
CIFE Stanford Seed Grant 2022/23, CIFE Stanford Seed Grant 2023/24, Stanford Human-Centered Artificial Intelligence - HAI, NVIDIA Omniverse Team.
@article{TONO2024105498,
title = {Vitruvio: Conditional variational autoencoder to generate building meshes via single perspective sketches},
journal = {Automation in Construction},
volume = {166},
pages = {105498},
year = {2024},
issn = {0926-5805},
doi = {https://doi.org/10.1016/j.autcon.2024.105498},
url = {https://www.sciencedirect.com/science/article/pii/S0926580524002346},
author = {Alberto Tono and Heyaojing Huang and Ashwin Agrawal and Martin Fischer},
keywords = {Artificial intelligence, Neural-aided design, Deep generative design, Deep generative modeling, Conditional variational autoencoder, Sketch-based modeling},
}