GitHub - hjjpku/multi_view_sort: multi-view cnn, Neural-EM, multi view alignment

This is the repository for our work Learning the Global Descriptor for 3D Object Recognition based on Multiple Views Decomposition

The paper is accpeted by IEEE Transactions on Multimedia (TMM 2020). If you find our work helpful, please consider to cite:

@article{huang2020learning,
	  title={Learning the Global Descriptor for 3D Object Recognition based on Multiple Views Decomposition},
	    author={Huang, Jingjia and Yan, Wei and Li, Thomas H and Liu, Shan and Li, Ge},
		  journal={IEEE Transactions on Multimedia},
		    year={2020},
			  publisher={IEEE}
}

As mentioned in our paper, we develop our project based on Su et al.,’s [1] implementation of MVCNN (we use codes from here). We implement our work with Python3.5.2 and Pytorch1.0.1.post2.
We conduct the experiments on a single Tesla K80 GPU with CUDA9.0.
For the sake of GPU memory consumption, we employ a two stage training strategy.We utilize a pretrained VGG-M model for the feature extraction, and train our VMM model with the features as input. To run our codes, you should

– Install the required dependencies, including:

pytorch
tensorboardX
numpy
scipy
For the detailed version, you can refer to the “requirement.txt”. (Note: not all the libs listed in requirement.txt are needed.)

– Prepare the dataset:

Download dataset Modelnet40 from here

Render 2D images by blender (we use codes from here)

Save the images, and arrange the dataset directory as follow:

/MVCNN dataset/class folders/train(or test)/rendered images

– Prepare the feature extractor(VGG-M) and extract the features:

Clone codes from xxx and put it under the root directory

We utilize a VGG-M pretrained on imagenet and than finetune it on ModelNet40 as the feature extractor. We adopt Su’s MVCNN codes to finetune the model.Due to the limitation of the size to the uploaded file, we cannot provide a pre-trained VGGM model in the supplementary materials directly. Therefore, we introduce the way to trained VGG-M on ModelNet40 instead:

cd feature extractor
python train mvcnn.py -name xxx -cnn_name vggm -train_path xxx -val_path xxx
For example:
python train mvcnn.py -name MVCNN -cnn_name vggm -train_path /Modelnet40_dataset_rendered_images/*/train -val_path /Modelnet40_dataset_rendered_images/*/test

Extract features with the trained vgg-m:

python mvcnn_save_feature.py -name xxx -cnn_name vggm
You should change the path of your pretrained model and the save direction in mvcnn_save_feature.py. Then you can get the saved features.

– To train our VMM on ModelNet40 with default settings:

Clone this repo into the root directory and named as VMM

Specify the path where the extracted features are saved, and run:

cd VMM
CUDA VISIEBLE DEVICES=0 python train nem.py -name NEM -train_pth xxx -val_path xxx
Then, you can check the training process with tensorboard. The log file as well as the trained model can be find at VMM/exp/NEM

- To try different settings, you can:

specify the number of latent views, for example, set it to 3 by “-cluster n 3”; specify the number of iteration, for example set it to 10 by “-iter 10” ; for more settings, you can refer to VMM/train nem.py

References
1. Su, J.C., Gadelha, M., Wang, R., Maji, S.: A deeper look at 3d shape classifiers. In The European Conference on Computer Vision (ECCV) Workshops (September 2018)

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.idea		.idea
models		models
tools		tools
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
kmean_read.py		kmean_read.py
mvcnn_att_test.py		mvcnn_att_test.py
mvcnn_save_feature.py		mvcnn_save_feature.py
mvcnn_temp.py		mvcnn_temp.py
render_shaded_black_bg.blend		render_shaded_black_bg.blend
requirement.txt		requirement.txt
train_mvcnn.py		train_mvcnn.py
train_mvcnn_attention.py		train_mvcnn_attention.py
train_mvcnn_attention_sort.py		train_mvcnn_attention_sort.py
train_mvcnn_fourview.py		train_mvcnn_fourview.py
train_mvcnn_kmean.py		train_mvcnn_kmean.py
train_mvcnn_self_attn_nostage1.py		train_mvcnn_self_attn_nostage1.py
train_nem.py		train_nem.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

– Install the required dependencies, including:

– Prepare the dataset:

– Prepare the feature extractor(VGG-M) and extract the features:

– To train our VMM on ModelNet40 with default settings:

- To try different settings, you can:

About

Releases

Packages

Contributors 2

Languages

hjjpku/multi_view_sort

Folders and files

Latest commit

History

Repository files navigation

– Install the required dependencies, including:

– Prepare the dataset:

– Prepare the feature extractor(VGG-M) and extract the features:

– To train our VMM on ModelNet40 with default settings:

- To try different settings, you can:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages