基于LSTM与衰减自注意力的答案选择模型  被引量:1

Answer selection model based on LSTM and decay self-attention

在线阅读下载全文

作  者:陈巧红[1] 李妃玉 孙麒[1] 贾宇波[1] CHEN Qiao-hong;LI Fei-yu;SUN Qi;JIA Yu-bo(School of Computer Science and Technology,Zhejiang Sci-Tech University,Hangzhou 310018,China)

机构地区:[1]浙江理工大学计算机科学与技术学院,浙江杭州310018

出  处:《浙江大学学报(工学版)》2022年第12期2436-2444,共9页Journal of Zhejiang University:Engineering Science

基  金:浙江理工大学中青年骨干人才培养经费项目。

摘  要:针对答案选择过程中存在语句特征、语句间的相关语义信息提取不充分的问题,在长短时记忆网络(LSTM)的基础上,提出基于LSTM和衰减自注意力的答案选择模型(DALSTM).DALSTM使用LSTM和衰减自注意力编码层提取丰富的上下文语义信息,通过衰减矩阵缓解反复使用注意力机制出现的权重过集中于关键词的问题.使用注意力机制对问题与答案间的信息进行双向交互,融合问答对间的相似性特征,丰富问答对间的相关语义信息.在WiKiQA、TrecQA及InsuranceQA数据集上的模型评估结果表明,相较于其他基于BiLSTM的先进模型,DALSTM的整体性能表现更好,3个数据集的平均倒数排名(MRR)分别达到0.757、0.871、0.743.An answer selection model based on the long short-term memory(LSTM)and decay self-attention(DALSTM)was proposed on the basis of LSTM network,aiming at the problem of insufficient extraction of sentence features and related semantic information between sentences in the answer selection process.Contextual semantic information was extracted more fully by DALSTM which used LSTM and decay self-attention coding layer,and the problem of weight over-focused on keywords caused by repeated use of the attention mechanism was alleviated by the delay matrix.The attention mechanism was used to conduct bidirectional interaction between the information of question and answer,integrate the similarity features between question and answer pairs,and enrich the relevant semantic information between question and answer pairs.DALSTM was evaluated on WiKiQA,TrecQA,and InsuranceQA data sets.Evaluation results showed that compared with other advanced BiLSTM-based models,the DALSTM model had a better overall performance,mean reciprocal rank(MRR)of three data sets reached 0.757,0.871 and 0.743,respectively.

关 键 词:问答(QA) 答案选择 长短时记忆(LSTM) 衰减自注意力 注意力机制 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象