问答模式下结合属性语义的实体属性抽取研究  

Research on Entity Attribute Extraction Combined with Attribute Semantics in Question Answering Mode

在线阅读下载全文

作  者:常露予 张晓滨[1] CHANG Lu-yu;ZHANG Xiao-bin(School of Computer Science,Xi’an Polytechnic University,Xi’an 710048,China)

机构地区:[1]西安工程大学计算机科学学院,陕西西安710048

出  处:《计算机技术与发展》2024年第4期174-179,共6页Computer Technology and Development

基  金:陕西省自然科学基础研究计划项目(2023-JC-YB-568)。

摘  要:实体属性抽取任务中常面临属性标签过多时模型存在爆炸风险的问题,且目前大多数属性抽取模型对文本均分配一致的注意力因子,未将上下文的变化考虑在内。为解决上述问题,提出一种基于问答模式的结合属性语义的实体属性抽取方法。该方法的要点在于,将文本看作上下文,把属性视为查询,从上下文中提取到的答案等同于期望的属性值。文中对文本和属性的语义表示进行建模,并提出一个动态注意力机制用于捕捉二者间的语义交互、实现信息融合,同时自适应地控制属性信息融入文本向量的程度。为了验证该方法的有效性,将模型与目前广泛应用的BiLSTM模型、BiLSTM-CRF模型、OpenTag模型和Open Tagging模型在包含大量属性标签的数据集AE-110K、AE-650K上进行对比实验,结果表明,模型在结合属性语义信息且采用动态Attention的条件下,其预测准确度、召回率和F1值更高。Entity attribute extraction tasks often face the problem of model explosion risk when there are too many attribute labels,and at present,most attribute extraction models assign consistent attention factors to texts,and do not take context changes into account.To solve the above problems,an entity attribute extraction method based on question-answering mode combined with attribute semantics is proposed.The key point of this method is that the text is regarded as the context,the attribute is regarded as the query,and the answer extracted from the context is equivalent to the expected attribute value.The semantic representation of text and attributes is modeled,and a dynamic attention is proposed to capture the semantic interaction,realize information fusion,and adaptively control the degree to which attribute information is integrated into the text vector.In order to verify the effectiveness of the proposed method,the model was compared with the currently widely used models such as BiLSTM,BiLSTM-CRF,OpenTag and Open Tagging on datasets AE-110K and AE-650K containing a large number of attribute tags.It is showed that under the condition of combining attribute semantic information and adopting dynamic Attention,the model has higher prediction accuracy,recall rate and F1 value.

关 键 词:问答模式 实体属性抽取 动态注意力 语义交互 信息融合 

分 类 号:TP398.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象