基于话题和修辞识别的阅读理解why型问题回答  被引量:9

Why-Questions Answering for Reading Comprehension Based on Topic and Rhetorical Identification

在线阅读下载全文

作  者:张志昌[1,2] 张宇[1] 刘挺[1] 李生[1] 

机构地区:[1]哈尔滨工业大学计算机学院信息检索研究中心,哈尔滨150001 [2]西北师范大学数学与信息科学学院,兰州730070

出  处:《计算机研究与发展》2011年第2期216-223,共8页Journal of Computer Research and Development

基  金:国家"八六三"高技术研究发展计划基金项目(2006AA01Z145);国家自然科学基金项目(60736044;60675034)

摘  要:针对阅读理解问答中的why型问题,提出基于问题话题和话题间因果修辞关系识别的答案句抽取方法.抽取时利用机器学习方法,选择可识别出对应问题话题的句子特征、问题话题与句子上下文之间因果关系特征,对篇章内的句子按照成为答案句的概率进行排序.对应问题话题的句子识别利用基于idf和语义角色的相似度;因果修辞关系的识别利用线索短语、特定语义角色、从文档集中挖掘的词间蕴含的因果关系概率信息、句子上下文的位置与表达形式.Remedia语料上的实验结果表明,该方法明显提高了why型问题回答的性能.As an important branch in the study of question answering system,automatic reading comprehension(RC) system involves reading a short passage of text and answering a series of questions pertaining to that text.In all question types including who,what,when,where,why studied in the field of RC,answer extraction of why-question should apply the discourse structure information of text and the answer is not an named entity.Concerning these difference of why-question with other types,an answer sentence extraction approach for why-question of reading comprehension is given in this paper based on question topic and causal rhetorical relation identification.It uses machine learning model to rank sentences in text according to their probabilities of becoming answer sentence.In the model,two kinds of feature are used for identification of text sentence corresponding to question topic and that of causal rhetorical relation between question topic and sentence context respectively.In all features,the idf and semantic role similarity features are utilized to identify the sentence corresponding to the question topic,and other features,including cue phrases,special semantic roles,causal relation entailment probabilities between words mined from large scale document collections,position and expression format of sentence context,are used to identify causal rhetorical relation.Experimental results on Remedia corpus show that the method improves significantly the performance of reading comprehension why-question answering.

关 键 词:why型问题 话题 修辞关系 答案抽取 阅读理解 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象