基于松弛Hadamard矩阵的多模态融合哈希方法  被引量:2

Multimodal Fusion Hash Learning Method Based on Relaxed Hadamard Matrix

在线阅读下载全文

作  者:庾骏 黄伟 张晓波 尹贺峰 YU Jun;HUANG Wei;ZHANG Xiao-bo;YIN He-feng(The College of Computer and Communication Engineering,Zhengzhou University of Light Industry,zhengzhou,Henan 450000,China;The School of Artificial Intelligence and Computer Science,Jiangnan University,Wuxi,Jiangsu 214000,China)

机构地区:[1]郑州轻工业大学计算机与通信工程学院,河南郑州450000 [2]江南大学计算机与人工智能学院,江苏无锡214000

出  处:《电子学报》2022年第4期909-920,共12页Acta Electronica Sinica

基  金:河南省科技攻关计划项目(No.222102210064);郑州轻工业大学博士科研启动基金(No.2021BSJJ025);国家自然科学基金(No.61902361)。

摘  要:哈希作为一种有效的数据表征技术,已经在应对爆炸式增长的多媒体数据中扮演了重要的角色.它由于低存储和高效率的优势,在多媒体检索领域受到了越来越多的关注.目前多模态哈希学习方法在多媒体检索任务中得到了较好的研究和发展.然而,多数的方法通过编码特征的内积重构成对相似度来保持原始数据的结构信息,但是带来较复杂的优化问题.此外一些模型缺乏判别性使得检索性能的提升受到限制.为了克服上述问题,本文提出一种新型的多模态融合哈希方法,在类别信息的监督下利用Hadamard矩阵为数据生成目标编码,通过松弛严格的二值约束增大类间的间隔,同时采用图嵌入的方式促进类内的紧凑性.本文提出的方法既保证了模型具有很好的判别能力也简化了优化过程.在3个公开数据集上的实验结果表明,本文提出的方法在多媒体数据检索中是非常有效的,平均性能上相比最优的对比方法提高了8.47%.Hashing,as an effective data representation technology,has played an important role in dealing with the explosive growth of multimedia data.Due to the advantages of its low storage and high efficiency,it has received more and more attention in the field of multimedia retrieval.At present,multi-modal hashing methods have been well researched and developed in multimedia retrieval tasks.However,most of these methods usually use the inner product of hashing features to reconstruct larger pairwise similarity,aiming to preserve the structural information of the original data,which will bring more complex optimization problems.Besides,some models lack discriminant ability,which leads to limitations in the improvement of retrieval performance.In order to overcome the above-mentioned problems,this paper proposes a new multimodal fusion hashing method.Under the supervision of category information,Hadamard matrix is used to generate target codes for data,and the margin between categories is increased by relaxing strict binary constraints.At the same time,the graph embedding approach is used to promote compactness within the class.The proposed method in this paper not only ensures the strong discriminative ability of the model,but also simplifies the optimization process.The experimental results on three public datasets show that the method proposed in this paper is very effective in multimedia data retrieval,and the average performance is 8.47%higher than that of the optimal comparison method.

关 键 词:哈希学习 多模态融合 HADAMARD矩阵 多媒体检索 哈希中心 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象