检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:邓远飞 李加伟 蒋运承[1,2] DENG Yuanfei;LI Jiawei;JIANG Yuncheng(School of Computer Science,South China Normal University,Guangzhou 510631,Guangdong,China;School of Artificial Intelligence,South China Normal University,Foshan 528225,Guangdong,China)
机构地区:[1]华南师范大学计算机学院,广东广州510631 [2]华南师范大学人工智能学院,广东佛山528225
出 处:《计算机工程》2024年第4期294-302,共9页Computer Engineering
基 金:国家自然科学基金(61772210,U1911201)。
摘 要:专利是授予发明者在一定时期内保护其发明的法定权利,在当今的社会活动中发挥着重要作用。然而现有研究并未针对专利相似度数据进行适配优化,导致其应用在专利短语相似度匹配任务中效果不佳。已有研究表明,在低资源的场景下,提示学习将文本片段(模板)作为输入,将分类问题转换为掩码语言建模问题,其关键的一步是在标签空间和标签词空间之间构造一个投影。提出一种基于知识注入的提示学习方法,将其应用于专利短语相似度匹配计算任务。为解决专利短语信息不足的问题,利用专利短语中的相似度标签信息,使用知识增强专利短语与标签信息。首先通过实体链接技术建立专利短语与外部知识的关联关系;然后设计一种基于实体影响度的邻域信息过滤机制,用于缓解专利短语信息不足的问题;最后考虑不同外部知识对专利短语相似度计算的影响,设计应用于专利短语的多种增强提示文本。实验结果表明,该方法的Pеarson相关系数(PCC)和Spеarman相关系数(SRC)相较次优对比方法分别提升6.8%和5.7%。A patent is a legal right conferred to inventors to protect their inventions for a limited time,and it plays a crucial role in present-day social activities.Existing research has not optimized the adaptation of patent similarity data,which has negatively affected matching patent phrase similarity.Previous research has shown that in low-resource scenarios,prompt learning uses text fragments(i.e.,templates)as input,transforming the classification problem into a mask language modeling problem;here,a key step is to construct a projection between the label space and label word space.This study presents a knowledge-based prompt learning method and applies it to the similarity matching of patent phrases.To solve the problem of insufficient information related to patent phrases,this study uses similarity label information in patent phrases and knowledge to enhance the patent phrases and label information.This study first establishes the relationship between patent phrases and external knowledge using entity-linking technology.The study then designs a neighborhood information filtering mechanism based on the degree of entity influence to expand the problem of insufficient patent phrase information.Finally,based on the effects of different types of external knowledge on the similarity calculation of patent phrases,the study generates a variety of enhanced prompt text applied to patent phrases.Experimental results show that the Pearson Correlation Coefficient(PCC)and Spearman Rank Correlation(SRC)of the proposed method are increased by 6.8%and 5.7%,respectively,as compared with the suboptimal method.
关 键 词:专利短语 相似度计算 知识注入 提示学习 提示文本
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.13