基于表示学习的技术融合差异度测度方法及其效果研究  被引量:5

Research on the Measurement Method and Effect of Technology Convergence Disparity Based on Representation Learning

在线阅读下载全文

作  者:吕璐成 赵亚娟[1,2] 王学昭 韩涛[1,2] 赵萍 张迪[1] Lyu Lucheng;Zhao Yajuan;Wang Xuezhao;Han Tao;Zhao Ping;Zhang Di(National Science Library,Chinese Academy of Sciences,Beijing 100190;Department of Library,Information and Archives Management,School of Economics and Management,University of Chinese Academy of Sciences,Beijing 100190)

机构地区:[1]中国科学院文献情报中心,北京100190 [2]中国科学院大学经济与管理学院图书情报与档案管理系,北京100190

出  处:《图书情报工作》2022年第4期118-128,共11页Library and Information Service

基  金:中国科学院战略研究专项“支撐我国重点产业发展的基础研究布局与关键技术储备研究”(项目编号:GHJ-ZLZX-2020-31-5)研究成果之一。

摘  要:[目的/意义]现有研究进行技术融合差异度测度时仅在分类号层面开展、尚未涉及到分类号背后的技术语义内涵层面,且未对测度方法的效果进行对比,对此,本研究从揭示技术语义的角度进行技术融合差异度测度方法研究和效果比较研究,助力其方法论的完善。[方法/过程]表示学习技术能够利用海量先验知识计算研究对象的语义差异,因此,提出基于Word2vec和Bert的技术融合差异度测度方法,可以利用专利分类号释义文本和关联专利文本来度量技术融合的差异度,共形成6种测度方法。采用这6种测度方法对2019-2020年申请的四方专利进行技术融合差异度的测度,与现有基于分类号共现频次和共现关系的差异度测度方法进行效果对比。[结果/结论]研究发现,同时利用专利分类号释义文本和关联专利文本,采用Word2Vec进行MC分类号向量化,较之其他方案能够更为有效地测算技术融合差异度,可以在未来技术融合的研究工作中推广应用。[Purpose/significance]When measuring the disparity of technology convergence,the existing studies only measure at the level of classification number,have not gone deep into the level of technical semantic connotation behind the classification number,and do not compare the effects of measurement methods.Therefore,this pa-per carries out the comparative research of methods and effects of technology convergence measurement from the perspective of revealing technology semantic,so as to help improve the methodology.[Method/process] Representation learning technology could take advantage of a large amount of prior knowledge to calculate the semantic differences of research objects.Therefore,this paper proposed a method to measure the disparity of technology convergence based on Word2vec and Bert,which could measure the disparity of technology convergence by using the interpretation text of patent classification number and the associated patent text.This study used these six measurement methods to measure the disparity of technology convergence of quadrilateral patents applied from 2019 to 2020,and compared with the existing disparity measurement methods based on the co-occurrence frequency and co-occurrence relationship of classification number.[Result/conclusion]This paper finds,by using the interpretation text of patent classification number and associated patent text at the same time,the MC classification number vectorization by using word2vec can more effectively measure the disparity of technology convergence than other schemes,which can be applied in the future research of technology convergence.

关 键 词:差异度 技术融合 技术会聚 表示学习 BERT Word2vec 

分 类 号:G306[文化科学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象