基于BERT-Graphormer的中文专利文本分类模型研究  被引量:1

Research on Chinese Patent Text Classification Model Based on BERT-Graphorme

在线阅读下载全文

作  者:李永忠 吕菲 黄种标 Li Yongzhong;Lu Fei;Huang Zhongbiao(College of Economics and Management,Fuzhou University,Fuzhou FuJian 350000)

机构地区:[1]福州大学经济与管理学院,福建福州350000

出  处:《情报探索》2024年第6期27-33,共7页Information Research

摘  要:[目的/意义]随着专利申请量不断增长和专利分类类别的复杂化,针对提高专利分类工作效率、审查质量、节约人力资源的需求,构建中文专利文本分类模型。[方法/过程]通过微调Graphormer模型对专利标签的结构及信息进行建模,利用建模后的标签信息来增强BERT模型的文本表示。[结果/结论]相对于其他基线模型,该模型的Micro-F1与Macro-F1分数分别提升了1.6%与3.5%。实现了多标签专利的自动分类,通过对标签、文本的信息进行建模、融合,从而进一步提升模型的分类效果。[Purpose/significance]With the increasing number of patent applications and the complexity of patent classification categories,the paper constructs a Chinese patent text classification model based on BERT-Graphormer in order to improve the efficiency of patent classification,review quality and save human resources.[Method/process]The paper models the structure and information of patent labels by fine-tuning the Graphormer model,and enhances the text representation of BERT model by using the modeled label information.[Result/conclusion]Compared with other baseline models,the Micro-F1 and Macro-F1 scores of the proposed model are increased by 1.6%and 3.5%.The study successfully achieved the automatic classification of multi-label patents.The model’s classification effectiveness is significantly improved by modeling and integrating information from labels and text.

关 键 词:专利分类 层次分类 注意力机制 BERT 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象