大语言模型驱动的油气田勘探开发数据智能检索方法  

Method for Intelligent Retrieval of Exploration and Development Data of Oil and Gas Fields Driven by Large Language Model

在线阅读下载全文

作  者:王娟 梁倩 王磊 方茹佳 王嘉乾 WANG Juan;LIANG Qian;WANG Lei;FANG Rujia;WANG Jiaqian(Digital and Intelligent Business Unit,Petro China Changqing Oil Field Company,Xi’an 710018,China;School of Cyber Engineering,Xidian University,Xi’an 710071,China)

机构地区:[1]长庆油田分公司数字和智能化事业部,西安710018 [2]西安电子科技大学网络与信息安全学院,西安710071

出  处:《西安工业大学学报》2024年第6期795-802,共8页Journal of Xi’an Technological University

基  金:国家自然科学基金项目(62172316)。

摘  要:针对自然语言到结构化查询语言(Natural Language to SQL,NL2SQL)问题在油气田勘探开发领域数据检索中的挑战,提出了一种基于大型NLP模型并融合外部知识库的智能数据检索新方法。首先,根据油气田勘探开发的业务场景构建种子数据,为模型训练奠定基础。借助“思维链”策略扩充数据集,提升数据覆盖度和多样性。接着,通过引入低秩适应(Low-Rank Adaptation of Large Language Models,LoRA)算法流程,优化模型在油气田数据检索任务上的表现。同时,整合外部知识库以提高模型对油气田专业数据的预测准确性和鲁棒性。实验结果表明,该方法在油气田勘探开发领域私有数据的检索准确率相较现有技术提高了20%。基于此,开发了一套用户友好的应用系统,具有直观的界面和强大的功能,展示了该研究方法在油气田数据智能检索中的实用性和优越性。In order to meet the challenges that NL2SQL has in querying exploration and development data of oil and gas fields,the paper proposes a novel intelligent retrieval method by employing large NLP models and integrating external knowledge bases.Initially,the seed data is constructed for the training of the model.The dataset is expanded with a ‘thought-chain' strategy to increase data coverage and diversity.Subsequently,the Low-Rank Adaptation of Large Language Models(LoRA) algorithm process is introduced to optimize the model's performance in specific tasks.And also,the external knowledge bases are integrated to enhance the accuracy and robustness of the model in the predictions of specialized oil and gas field data.Experimental results indicate that by this method,compared to existing technologies,the retrieval accuracy of specific private data is improved by 20%.Additionally,a user-friendly application system has been developed,featuring a intuitive interface and robust functionality,which demonstrates the research method is practical and superior in intelligent retrieval of oil and gas field data.

关 键 词:NL2SQL 油气田勘探开发 低秩适应(LoRA) 外部知识库 思维链 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象