中文文本情感主题句分析与提取研究  被引量:10

Extraction of sentiment topic sentences of Chinese texts

在线阅读下载全文

作  者:樊娜[1] 蔡皖东[1] 赵煜[1] 李慧贤[1] 

机构地区:[1]西北工业大学计算机学院,西安710072

出  处:《计算机应用》2009年第4期1171-1173,1176,共4页journal of Computer Applications

基  金:西安电子科技大学计算机网络与信息安全教育部重点实验室开放基金资助项目(2008CNIS-07)

摘  要:提出一种提取中文文本情感主题句子的方法。首先评估文本中语义概念的概括和归纳能力,确定文本主题概念。将包含主题概念的句子作为候选主题句子,计算各个候选句子的重要度,最终确定文本主题句。然后采用条件随机场模型,选取情感倾向特征和转移词特征训练模型,从文本主题句集合中提取情感主题句。实验证明,以提出的方法为基础进行文本情感分析,避免了与主题无关的句子对分析结果的影响,有效地提高了文本情感分析的准确率。This paper proposed a method of extracting sentiment topic sentences. Firstly semantic concepts of a text were evaluated in order to determine which concepts were related to the topic of a text. And the concepts related to the topic were regarded as topic concepts. Sentences including one or more topic concepts were defined as candidate sentences. Significance of every candidate sentences was calculated in order to which ones were topic sentences in the text. Conditional random field model was adopted and two kinds of feature were used in the model training, and one feature was polarity of sentiment and the other feature was transferring words. This approach excluded sentences that were not related to the topic of the text, and eliminated the influence brought by these sentences. Therefore, precision of sentiment analysis is effectively improved.

关 键 词:情感分析 主题概念 条件随机场 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象