检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:刘燕[1] 孙月萍[1] 侯丽[1] LIU Yan;SUN Yueping;HOU Li(Institute of Medical Information,Chinese Academy of Medical Sciences&Peking Union Medical College,Beijing 100020,China)
机构地区:[1]中国医学科学院/北京协和医学院医学信息研究所,北京100020
出 处:《医学信息学杂志》2022年第12期32-38,共7页Journal of Medical Informatics
基 金:中国工程科技知识中心建设项目“医药卫生专业知识服务系统”(项目编号:CKCEST-2022-1-6);国家社科青年基金项目“基于语义增强的医学学术出版创新融合研究”(项目编号:18CTQ024)。
摘 要:分析中文科技文献中机构著录项的组织特点和中文机构名称的命名特点,详细阐述常见机构名称规范化方法、中文科技文献机构名称规范化处理流程,提出利用字符串匹配词典和规则过滤等方法提取规范化的机构名称,并基于机构-作者共现关系,计算作者共现率,结合绝对共现量和共现率阈值实现机构实体的消歧,能够有效匹配同一机构的不同表现形式。The paper analyzes the organization characteristics of institution description items in Chinese scientific and technical literature and the naming characteristics of Chinese institutions,expounds the common methods of institution name normalization and the process of institution name normalization for Chinese scientific and technical literature,and proposes that the methods of extracting the normalized institution names by using the methods of string matching,dictionary-based and rule-based filtering,calculating the co-occurrence rate of authors based on the co-occurrence relationship between institutions and authors,and disambiguating the institution entities through the number of absolute co-occurrence and the co-occurrence rate threshold,which can effectively match different forms of an institution.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.12.102.204