Official implementation of Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding (GeoZe).
Guofeng Mei, Luigi Riz, Yiming Wang, Fabio Poiesi
Technologies of Vision (TeV), Foundation Bruno Kessler
{gmei, luriz, ywang, poiesi}@fbk.eu
CVPR 2024 Project Page | Arxiv Paper
- We release the code for zero-shot 3D part segmentation 🔥.
- Our paper has been accepted by CVPR 2024 🔥.
We introduce the first training-free aggregation technique that leverages the point cloud’s 3D geometric structure to improve the quality of the transferred VLM representations.
Our approach first clusters point cloud
Prepare environment for part segmentation
conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install open-clip-torch==2.24.0
pip install open3d natsort matplotlib tqdm opencv-python scipy plyfile
Part segmentation on ShapeNet
python part_run.py --datasetpath Your_shapenet_path
- Provide code for part segmentation
- Provide code for shape classification
- Provide code semantic segmentation
- Support in-webiste demo
- Provide code for nuScenes
- Support the latest PyTorch version
We are very much welcome all kinds of contributions to the project.
If you find our code or paper useful, please cite
@inproceedings{mei2024geometrically,
title = {Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding},
author = {Mei, Guofeng and Riz, Luigi and Wang, Yiming and Poiesi, Fabio},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2024}
This repo benefits from PointCLIPV2, CLIP, and OpenScene. Thanks for their wonderful works.