This is the PyTorch code for Self-Supervised 3D Scene Flow Estimation and Motion Prediction using Local Rigidity Prior (T-PAMI 2024). You can also check out the arXiv version at RigidFlowPP-arXiv.
In this repository, we apply RigidFlow++ to self-supervised 3D scene flow estimation. For the codes in self-supervised motion prediction, please refer to RigidFlowPP-Motion. The code is created by Ruibo Li ([email protected]).
-
Python 3.7.16, NVIDIA GPU + CUDA CuDNN, PyTorch (torch == 1.9.0),
-
tqdm, sklearn, pptk, yaml, numba, thop
Create a conda environment for RigidFlow++:
conda create -n RigidPP python=3.7.16
conda activate RigidPP
pip install torch==1.9.0+cu111 -f https://download.pytorch.org/whl/torch_stable.html
pip install tqdm pptk PyYAML numba thop
Compile the furthest point sampling, grouping, and gathering operation for PyTorch. We use the operation from this repo.
cd lib
python setup.py install
cd ../
Install & compile supervoxel segmentation method:
cd Supervoxel_utils
g++ -std=c++11 -fPIC -shared -o main.so main.cc
cd ../
More details about the supervoxel segmentation method, please refer to Supervoxel-for-3D-point-clouds.
By default, the datasets are stored in SAVE_PATH
.
-
FlyingThings3D data provided by HPLFlowNet
Download and unzip the "Disparity", "Disparity Occlusions", "Disparity change", "Optical flow", "Flow Occlusions" for DispNet/FlowNet2.0 dataset subsets from the FlyingThings3D website (we used the paths from this file, now they added torrent downloads). They will be unzipped into the same directory,
RAW_DATA_PATH
. Then run the following script for 3D reconstruction:python data_preprocess/process_flyingthings3d_subset.py --raw_data_path RAW_DATA_PATH --save_path SAVE_PATH/FlyingThings3D_subset_processed_35m --only_save_near_pts
This dataset is denoted FT3Ds in our paper.
-
FlyingThings3D data provided by FlowNet3D
Download and unzip data processed by FlowNet3D to directory
SAVE_PATH
. This dataset is denoted FT3Do in our paper.For the experiments on FT3Do, we generate supervoxels for training samples offline:
python data_preprocess/generate_voxel_for_FT3D_O.py --data_root SAVE_PATH --num_supervoxels 30
-
KITTI scene flow data provided by HPLFlowNet
Download and unzip KITTI Scene Flow Evaluation 2015 to directory
RAW_DATA_PATH
. Run the following script for 3D reconstruction:python data_preprocess/process_kitti.py RAW_DATA_PATH SAVE_PATH/KITTI_processed_occ_final
This dataset is denoted KITTIs in our paper.
-
KITTI scene flow data provided by FlowNet3D
Download and unzip data processed by FlowNet3D to directory
SAVE_PATH
. This dataset is denoted KITTIo in our paper. -
Unlabeled KITTI raw data
In our paper, we use raw data from KITTI for self-supervised scene flow learning. We release the unlabeled training data here for download. This dataset is denoted KITTIr in our paper.
The final data directory will be organized as follows:
SAVE_PATH
|-- FlyingThings3D_subset_processed_35m (FT3D_s)
|-- data_processed_maxcut_35_20k_2k_8192 (FT3D_o)
|-- data_processed_maxcut_35_20k_2k_8192_30 (supervoxel for each pc1 in FT3D_o)
|-- data_processed_maxcut_35_20k_2k_8192_30_back (supervoxel for each pc2 in FT3D_o)
|-- KITTI_processed_occ_final (KITTI_s)
|-- kitti_rm_ground (KITTI_o)
|-- KITTI_Raw (KITTI_r)
Set data_root
in each configuration file to SAVE_PATH
in the data preprocess section.
We adopt Bi-PointFlowNet as the scene flow estimation model. Our trained models can be downloaded from Model trained on FT3Ds, Model trained on FT3Do, and Model trained on KITTIr.
-
Model trained on non-occluded FT3Ds
When evaluating this pre-trained model on FT3Ds testing data, set
dataset
toFlyingThings3DSubset
. And when evaluating this pre-trained model on KITTIs data, setdataset
toKITTI
. Then run:python BiFlow-test/evaluate_bid_pointconv.py config_eval_bid_pointconv.yaml
-
Model trained on occluded FT3Do
When evaluating this pre-trained model on FT3Do testing data, set
dataset
toFlyingThings3D_OCC
. And when evaluating this pre-trained model on KITTIo data, setdataset
toKITTI_OCC
. Then run:python BiFlow-test/evaluate_bid_pointconv_Occ.py config_eval_bid_pointconv_Occ.yaml
-
Model trained on raw KITTIr
Evaluate this pre-trained model on KITTIo:python BiFlow-test/evaluate_bid_pointconv_KITTI.py config_eval_bid_pointconv_KITTI.yaml
Set data_root
in each configuration file to SAVE_PATH
in the data preprocess section.
-
Train model on FT3Ds:
python BiFlow/train_bid_pointconv.py config_train_bid_pointconv.yaml
-
Train model on FT3Do:
python BiFlow/train_bid_pointconv_Occ.py config_train_bid_pointconv_Occ.yaml
-
Train model on KITTIr:
python BiFlow/train_bid_pointconv_KITTI_R.py config_train_bid_pointconv_KITTI.yaml
If you find this code useful, please cite our paper:
@article{li2024self,
title={Self-Supervised 3D Scene Flow Estimation and Motion Prediction using Local Rigidity Prior},
author={Li, Ruibo and Zhang, Chi and Wang, Zhe and Shen, Chunhua and Lin, Guosheng},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
year={2024},
publisher={IEEE}
}
@inproceedings{li2022rigidflow,
title={RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior},
author={Li, Ruibo and Zhang, Chi and Lin, Guosheng and Wang, Zhe and Shen, Chunhua},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={16959--16968},
year={2022}
}
Our code is based on RigidFlow, HPLFlowNet, FlowNet3D, PointPWC, Bi-PointFlowNet, and flownet3d_pytorch. The supervoxel segmentation method is based on Supervoxel-for-3D-point-clouds.