故事启发大语言模型的时序知识图谱预测  被引量:1

Narrative-Driven Large Language Model for Temporal Knowledge Graph Prediction

在线阅读下载全文

作  者:陈娟 赵新潮 隋京言 祁麟 田辰 庞亮[1,2] 方金云 CHEN Juan;ZHAO Xinchao;SUI Jingyan;QI Lin;TIAN Chen;PANG Liang;FANG Jinyun(Prospective Research Laboratory,Institute of Computing Te-chnology,Chinese Academy of Sciences,Beijing 100190;School of Computer Science and Technology,University of Chinese Academy of Sciences,Beijing 100049;Landinn Center,Big Data Academy,Zhongke,Zhengzhou 450046;School of Software Engineering,Beijing Jiaotong University,Beijing 100044)

机构地区:[1]中国科学院计算技术研究所前瞻研究实验室,北京100190 [2]中国科学院大学计算机科学与技术学院,北京100049 [3]中科大数据研究院兰亭中心,郑州450046 [4]北京交通大学软件学院,北京100044

出  处:《模式识别与人工智能》2024年第8期715-728,共14页Pattern Recognition and Artificial Intelligence

摘  要:时序知识图谱海量稀疏,实体的长尾分布导致对分布外实体的推理泛化性较差,历史交互低频导致对未来事件的预测偏差较大.为此,文中提出故事启发大语言模型的时序知识图谱预测方法,利用大语言模型的世界知识储备和复杂语义推理能力,增强对分布外实体的理解和交互稀疏事件的关联.首先,根据时序知识图谱中时间和结构的特性筛选“关键事件树”,通过历史事件筛选策略提炼最具代表性的事件,并摘要当前查询相关的历史信息,减少数据输入量并保留最重要的信息.然后,微调大语言模型生成器,生成时序语义关联且符合逻辑的“关键事件树”叙事故事,作为非结构化输入.在生成过程中,特别关注事件之间的因果关系和时间顺序,确保生成的故事具有连贯性和合理性.最后,利用大语言模型推理器推理缺失的时序实体.在3个公开数据集上的实验表明,文中方法可充分发挥大模型的能力,完成精准的时序实体推理.The temporal knowledge graph(TKG)is characterized by vast sparsity,and the long-tail distribution of entities leads to poor generalization in reasoning for out-of-distribution entities.Additionally,the low infrequency of historical interactions results in biased predictions for future events.Therefore,a narrative-driven large language model for TKG Prediction is proposed.The world knowledge and complex semantic reasoning capabilities of large language models are leveraged to enhance the understanding of out-of-distribution entities and the association of sparse interaction events.Firstly,a key event tree is selected based on the temporal and structural characteristics of TKG,and the most representative events are extracted through a historical event filtering strategy.Relevant historical information is summarized to reduce input data while the most important information is retained.Then,the large language model generator is fine-tuned to produce logically coherent"key event tree"narratives as unstructured input.During the generation process,special attention is paid to the causal relationships and temporal sequences of events to ensure the coherence and rationality of the generated stories.Finally,the large language model is utilized as a reasoner to infer the missing temporal entities.Experiments on three public datasets demonstrate that the proposed method effectively leverages the capabilities of large models to achieve more accurate temporal entity reasoning.

关 键 词:时序知识图谱(TKG) 大语言模型 关键事件树 时序故事 事件推理 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象