检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:严李强 吴瑜 拉巴顿珠 梁炜恒 YAN Liqiang;WU Yu;Lhakpa-Dondrub;LIANG Weiheng(School of Information Science and Technology,Tibet University,Lhasa 850000,China)
机构地区:[1]西藏大学信息科学技术学院,西藏拉萨850000
出 处:《高原科学研究》2024年第3期110-116,共7页Plateau Science Research
基 金:国家自然科学基金项目(62406256);西藏大学研究生高水平人才培养计划项目(2021-GSP-B031,2022-GSP-S105).
摘 要:随着藏文数字资源和使用需求的增长,如何准确地检索到用户所需信息成为一项重要挑战。为解决藏文检索中查询信息和文档语义匹配问题,文章首先利用LaBSE模型从藏文文档中提取特征信息,然后将查询信息和特征信息一同输入模型,通过掩码语言模型和翻译语言模型等预训练任务,学习不同藏文音节字在不同语境下的深层语义信息;最后进行微调完成基于LaBSE的藏文信息检索模型的构建。实验结果表明,文章构建的藏文信息检索模型准确率达到93.57%,相比基于BERT的藏文信息检索模型准确率提高了6.37%,表明了文章构建的藏文信息检索模型能够更有效地匹配查询信息和藏文文档,为准确检索藏文资源问题提供了一种参考。With the growth of Tibetan resources and usage demand,it has become an important challenge to retrieve the information required by users accurately.To solve the problem of query information and semantic matching between documents in Tibetan retrieval,a Tibetan information retrieval model based on LaBSE is proposed in this paper.For constructing the model,a LaBSE model was first used to extract feature information from Tibetan documents and then input the query information and feature information into the model together.Through pre-training tasks such as the mask language model and translation language model,the model learned the deep semantic information of different Tibetan characters from different contexts.Finally,fine-tuning was carried out to complete the construction of the model.The experimental results show that the accuracy of the Tibetan information retrieval model constructed in this paper reaches 93.57%,which is 6.37%higher than that of the Tibetan information retrieval model based on BERT,indicating that our model can more effectively match the query information and Tibetan documents,which provides a reference for accurate retrieval of Tibetan resources.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.118.27.44