检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:高云龙 吴川[1] 朱明[1] GAO Yunlong;WU Chuan;ZHU Ming(Changchun Institute of Optics,Fine Mechanics and Physics,Chinese Academy of Science,Changchun 130033,China;Key Laboratory of Airborne Optical Imaging and Measurement,Chinese Academy of Sciences,Changchun 130033,China)
机构地区:[1]中国科学院长春光学精密机械与物理研究所,长春130033 [2]中国科学院航空光学成像与测量重点实验室,长春130033
出 处:《吉林大学学报(理学版)》2020年第4期923-930,共8页Journal of Jilin University:Science Edition
基 金:国家自然科学基金(批准号:61401425);吉林省科技发展计划项目(批准号:20200571505JH).
摘 要:基于卷积神经网络,提出一种基于改进卷积神经网络的短文本分类模型.首先,采用不同编码方式将短文本映射到不同空间下的分布式表示,提取不同粒度的数字特征作为短文本分类模型的多通道输入,并根据标准知识库提取概念特征作为先验知识,提高短文本的语义表征能力;其次,在全连接层增加自编码学习策略,在近似恒等的基础上进一步组合数字特征,模拟数据内部的关联性;最后,利用相对熵原理为模型增加稀疏性限制,降低模型复杂度的同时提高模型的泛化能力.通过对开源数据集进行短文本分类实验,验证了模型的有效性.We proposed a short text classification model based on improved convolutional neural network.Firstly,different coding methods were used to map short text to distributed representation in different spaces,and digital features of different granularities were extracted as multi-channel inputs of short text classification model.Extracting concept features from standard knowledge base as prior knowledge to improve the semantic representation ability of short text.Secondly,the self-coding learning strategy was added to the full connection layer,on the basis of approximate identity,the digital features were further combined to simulate the relevance within the data.Finally,the principle of relative entropy were used to increase the sparsity limit of the model,reduce the complexity and improve the generalization ability of the model.The effectiveness of the proposed model was verified by short text classification experiments on the open source dataset.
关 键 词:卷积神经网络 短文本 概念分布式表示 稀疏 自编码
分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.117