基于胶囊网络的文本数据真值发现  被引量:1

Truth Discovery of Text Data Based on Capsule Network

在线阅读下载全文

作  者:陶嘉庆 樊树海[1] 曹建军 常宸 TAO Jia-qing;FAN Shu-hai;CAO Jian-jun;CHANG Chen(Nanjing University of Technology,Nanjing Jiangsu 210009,China;The Sixty-third Research Institute,National University of Defense Technology,Nanjing Jiangsu 210007,China;Army Engineering University,Nanjing Jiangsu 210007,China)

机构地区:[1]南京工业大学,江苏南京210009 [2]国防科技大学第六十三研究所,江苏南京210007 [3]陆军工程大学,江苏南京210007

出  处:《计算机仿真》2023年第1期410-417,538,共9页Computer Simulation

基  金:国家自然科学基金(61371196,71671089);中国博士后科学基金项目(20090461425,201003797);国家重大科技专项(2015ZX01040201-003);江苏省研究生实践创新计划项目(SJCX21_0416,SJCX21_0417)。

摘  要:为解决传统真值发现算法无法提取文本数据关键语义信息的问题,提出一种基于胶囊网络的文本数据真值发现算法(Truth Discovery of Text Data Based on Capsule Network,Caps-Truth),对传统卷积神经网络(Convolutional Neural Network,CNN)进行改进,在神经网络模型中构造语义胶囊层替代CNN池化层表征文本语义信息。首先通过CNN卷积层获取文本数据全局特征,利用初级胶囊层将特征信息向量化,再通过语义胶囊层表征文本数据细粒度语义信息,将特征向量输入全连接神经网络挖掘文本数据可信度并获得可靠答案。上述算法在真值发现中引入胶囊网络,利用动态路由算法整合零散语义,有效提高了文本数据真值发现的效果。实验结果表明,Caps-Truth算法优于对比算法。In order to solve the problem that the traditional truth value discovery algorithm cannot extract the key semantic information of text data,this paper proposes a truth discovery algorithm bases on Capsule Network(Caps-Truth).We improve the traditional convolutional neural network and add a semantic capsule layer in the network in place of the pooling layer to extract semantic information.Firstly,global features were extracted by the convolutional layer and then features were vectorized by the primary capsule layer.Secondly,the fine-grained semantic information was obtained through the semantic capsule layer.Finally,the vectors were input into the fully connected neural network to discover the truth value and mine the credibility of text data.This algorithm introduces the capsule network into truth discovery,and the dynamic routing algorithm is used to integrate scattered semantics.Through experimental verification,demonstrating our method can effectively improve the effect of truth discovery and the results are superior to the comparison algorithm.

关 键 词:数据质量 神经网络 胶囊网络 文本数据 真值发现 

分 类 号:TP311[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象