残差增强的图像描述符  被引量:3

Residual Enhanced Image Descriptor

在线阅读下载全文

作  者:魏本昌 郑丽 管涛[2] Wei Benchang;Zheng Li;Guan Tao(School of Electrical and Information Engineering,Hubei University of Automotive Technology,Shiyan 442002;School of Computer Science & Technology,Huazhong University of Science and Technology,Wuhan 430074)

机构地区:[1]湖北汽车工业学院电气与信息工程学院,十堰442002 [2]华中科技大学计算机科学与技术学院,武汉430074

出  处:《计算机辅助设计与图形学学报》2019年第6期1039-1045,共7页Journal of Computer-Aided Design & Computer Graphics

基  金:国家自然科学基金(61272202);湖北汽车工业学院博士基金(BK201603)

摘  要:针对增大视觉码书在提高图像全局描述符——局部特征聚合描述符(VLAD)精度的同时会增加VLAD存储开销的问题,提出一种基于2层结构层次视觉码书生成残差增强的图像全局描述符EVLAD.离线码书生成阶段,首先通过K-means算法生成第1层视觉码书,然后基于量化残差最小化原则非均匀地生成第2层各视觉子码书.在线EVLAD生成阶段,图像局部特征首先面向细粒度的第2层视觉子码书生成量化残差;然后面向第1层视觉码书进行聚集生成各子向量,EVLAD即为各子向量的串联结果,为了抑制特征空间爆发现象,各子向量和串联结果分别进行了L2归一化.实验结果表明,EVLAD精度优于VLAD和其他各种改进方法.For VLAD, higher search accuracy will be obtained by increasing the size of visual codebook, but more memory usage is entailed. To solve the contradiction between search quality and memory usage, a global image descriptor called EVLAD, aggregating finer residual by use of hierarchical visual codebook with two-layer structure, is proposed. In the offline preprocessing stage, firstly, the first layer visual codebook is learned with K-means in the local descriptor space, and then each visual sub-codebook of the second layer is generated non-uniformly based on the quantization residual minimization criterion. In the online generation stage, the idea of EVLAD is associating the residual generation and accumulation process to different layer visual words, i.e., for a local descriptor, the residual, which is generated by subtracting the second layer nearest visual word from the local descriptor, is summed to a vector corresponding to one of the first layer visual word, and then EVLAD is the concatenation of all vector. In order to suppress the burst phenomena in feature space, L2 -normalization is employed for each subsector and the final concatenation vector. The experimental result shows our EVLAD outperforms VLAD and other modified strategies.

关 键 词:图像描述符 层次视觉码书 L2归一化 积量化 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象