检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:康雁[1] 李浩[1] 梁文韬 宁浩宇 霍雯 KANG Yan;LI Hao;LIANG Wentao;NING Haoyu;HUO Wen(School of Software,Yunnan University,Kunming 650500,China)
机构地区:[1]云南大学软件学院,昆明650500
出 处:《计算机工程与应用》2020年第7期205-209,共5页Computer Engineering and Applications
基 金:国家自然科学基金(No.61762092,No.61762089);云南省软件工程重点实验室开放基金(No.2017SE204)。
摘 要:针对深度学习方法中文本表示形式单一,难以有效地利用语料之间细化的特征的缺陷,利用中英文语料的不同特性,有区别地对照抽取中英文语料的特征提出了一种新型的textSE-ResNeXt集成模型。通过PDTB语料库对语料的显式关系进行分析,从而截取语料主要情感部分,针对不同中、英文情感词典进行情感程度关系划分以此获得不同情感程度的子数据集。在textSE-ResNeXt神经网络模型中采用了动态卷积核策略,以此对文本数据特征进行更为有效的提取,模型中融合了SEnet和ResNeXt,有效地进行了深层次文本特征的抽取和分类。将不同情感程度的子集上对textSE-ResNeXt模型采用投票集成的方法进一步提高分类效率。分别在中文酒店评论语料和六类常见英文分类数据集上进行实验。实验结果表明了本模型的有效性。Aiming at the deep learning method that the text representation is single, and difficult to effectively use the defects of the refined features between the corpus. For the different characteristics between the Chinese and English corpora,a new type of textSE-ResNeXt integration model is proposed by distinguishing the characteristics of Chinese and English corpus. Through the PDTB corpus, the explicit relationship of the corpus is analyzed, so that the main emotional part of the corpus is intercepted. The emotional degree relationship is divided according to different Chinese and English sentiment lexicons, and sub-data sets with different levels of emotion are gotten. In the textSE-ResNeXt neural network model,the dynamic convolution kernel strategy is adopted to extract the text data features more effectively. The model incorporates SEnet and ResNeXt, which effectively extracts and classifies deep text features. The subset of different emotion levels is used to further improve the classification efficiency by adopting the voting integration method for the textSEResNeXt model. Experiments are conducted on Chinese hotel commentary corpus and six common English classification data sets. The experimental results show the effectiveness of the model.
关 键 词:文本情感分类 textSE-ResNeXt 特征划分 集成模型
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222