The code for the following paper:
- Hengcan Shi, Hongliang Li, Qingbo Wu, Zichen Song, “Scene Parsing via Integrated Classification Model and Variance-Based Regularization”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Long Beach, the U.S. June 16-20, 2019. PDF
@InProceedings{Shi_2019_CVPR,
author = {Shi, Hengcan and Li, Hongliang and Wu, Qingbo and Song, Zichen},
title = {Scene Parsing via Integrated Classification Model and Variance-Based Regularization},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2019}
}
Existing deep-learning-based methods usually use one general classifier to recognize all object categories. However, the general classifier easily makes some mistakes in dealing with some confusing categories that share similar appearances or semantics. In this paper, we propose an integrated classification model and a variance-based regularization to achieve more accurate classifications. On the one hand, the integrated classification model contains multiple classifiers, not only the general classifier but also a refinement classifier to distinguish the confusing categories. On the other hand, the variance-based regularization differentiates the scores of all categories as large as possible to reduce misclassifications. Specifically, the integrated classification model includes three steps. The first is to extract the features of each pixel. Based on the features, the second step is to classify each pixel across all categories to generate a preliminary classification result. In the third step, we leverage a refinement classifier to refine the classification result, focusing on differentiating the high-preliminary-score categories. An integrated loss with the variance-based regularization is used to train the model.
- Install Matlab (The version lower than 2014b is highly recommended. Higher versions may be incompatible with Caffe.)
- Install my modified Caffe and matcaffe (Download)
- Download the datasets (NYU_Depth_v2)
- Please uncompress the datasets and put them in folder
./data
- Download models (initialization_model,trained_on_NYU_Depth_v2)
- Please put models in folder
./models
- Open the file
./net_train.m
- Modify the config, including the caffe path, opts.dataset, opts.useGpus and others.
- Run the file
./net_train.m
- If you want to train on your own datasets, please modify files to fit your dataset and run the file
./generate_bicls_label.m
to generate labels for the binary classification.
- Open the file
./net_test.m
- Modify the config, including the caffe path, opts.dataset, opts.useGpus and others.
- Run the file
./net_test.m
Thanks the evaluation code from sceneparsing!
shihengcan AT gmail DOT com