Scene recognition combining structural and textural features  被引量:7

Scene recognition combining structural and textural features

在线阅读下载全文

作  者:ZHOU Li HU DeWen ZHOU ZongTan 

机构地区:[1]Department of Automatic Control, College of Mechatronics and Automation, National University of Defense Technology

出  处:《Science China(Information Sciences)》2013年第7期221-234,共14页中国科学(信息科学)(英文版)

基  金:supported by the National Natural Science Foundation of China (Grant Nos. 60736018, 60835005);the National Basic Research Program of China (Grant No. 2007CB311001);the New Century Excellent Talents in University (Grant No. NCET-08-0147);the Hunan Provincial Innovation Team Project

摘  要:Automatic recognition of the contents of a scene is an important issue in the field of computer vision. Although considerable progress has been made, the complexity of scenes remains an important challenge to computer vision research. Most previous approaches for scene recognition are based on the so-called "bag of visual words" model, which uses clustering methods to quantize numerous local region descriptors into a codebook. The size of the codebook and the selection of initial clustering centers greatly affect the performance. Furthermore, the large size of the codebook leads to high computational costs and large memory consumption. To overcome these weaknesses, we present an unsupervised natural scene recognition approach that is not based on the "bag of visual words" model. This approach constructs multiple images of different resolutions and extracts structural and texturM features from these images. The structural features are represented by weighted histograms of the gradient orientation descriptor, which is presented in this paper, and the textural features are represented by filter responses of Gabor filters and a Schmid set. We regard the structural and textural features as two independent feature channels, and combine them to realize automatic categorization of scenes using a support vector machine. We then evaluated our approach using three commonly used datasets with various scene categories. Our experiments demonstrate that the weighted histograms of the gradient orientation descriptor outperform the classical scMe invariant feature transform descriptor in natural-scene recognition, and our approach achieves good performance with respect to current state-of-the-art methods.Automatic recognition of the contents of a scene is an important issue in the field of computer vision. Although considerable progress has been made, the complexity of scenes remains an important challenge to computer vision research. Most previous approaches for scene recognition are based on the so-called "bag of visual words" model, which uses clustering methods to quantize numerous local region descriptors into a codebook. The size of the codebook and the selection of initial clustering centers greatly affect the performance. Furthermore, the large size of the codebook leads to high computational costs and large memory consumption. To overcome these weaknesses, we present an unsupervised natural scene recognition approach that is not based on the "bag of visual words" model. This approach constructs multiple images of different resolutions and extracts structural and texturM features from these images. The structural features are represented by weighted histograms of the gradient orientation descriptor, which is presented in this paper, and the textural features are represented by filter responses of Gabor filters and a Schmid set. We regard the structural and textural features as two independent feature channels, and combine them to realize automatic categorization of scenes using a support vector machine. We then evaluated our approach using three commonly used datasets with various scene categories. Our experiments demonstrate that the weighted histograms of the gradient orientation descriptor outperform the classical scMe invariant feature transform descriptor in natural-scene recognition, and our approach achieves good performance with respect to current state-of-the-art methods.

关 键 词:scene recognition structural feature textural feature feature combination weighted histograms of gradient orientation descriptor 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术] TP311.1[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象