检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:邓盼盼[1] 孙海霞[1] DEND Pan-pan;SUN Hai-xia(Institute of Medical Information,Chinese Academy of Medical Sciences,Beijing 100020,China)
机构地区:[1]中国医学科学院医学信息研究所,北京100020
出 处:《中华医学图书情报杂志》2020年第1期12-21,共10页Chinese Journal of Medical Library and Information Science
基 金:国家科技图书文献中心专项任务“NSTL统一资源分类体系建设”(2019XM10-03);中国医学科学院医学与健康科技创新工程“中文临床医学术语系统构建研究”(2017-I2M-3-014);NSTL下一代科技创新开放系统先期研发任务“STKOS自动构建与维护关键技术研究”(XQYF0102)。
摘 要:目标:实现缩略语规范控制与管理,减少缩略语歧义造成的文本理解错误。方法:以跨领域、多来源词表集成时缩略语歧义术语控制为目标场景,基于缩略语构词语法特征,制定集成词表系统中缩略语的识别与提取方法;参考ISO 25964标准数据模型,开展缩略语词表层面、概念层、术语层数据模型和元数据方案设计并实现集成词表及缩略语表概念中的规范缩略语、规范全称、其他缩略语及其全称、普通术语的自动识别与术语类型自动标识,以及缩略语概念的规范表达与描述。结果:缩略语表、概念和术语描述的数据模型与元数据方案可快速实现10297个概念、135423个同义术语、121154对广义和窄义关系的缩略语表构建,并对STKOS超级科技词表中缩略语所在概念的全部术语实现了术语类型标识与歧义控制。结论:提出的缩略语规范控制方案和构建策略,一方面可快速标识缩略语及相关术语,实现概念内更细颗粒度的含义揭示与歧义控制;另一方面可快速构建具有丰富语义的缩略语表,并进行规范描述,促进计算机对领域缩略语的语义理解。Objective To reduce the abbreviation ambiguity-induced misunderstanding of texts by implementing the standardized control and management of abbreviations. Methods The identification and extraction methods of abbreviations in integrated thesaurus system were worked out based on their grammatical characteristics with the control of abbreviation ambiguity as its target scenario when the cross-domain and multi-source thesaurus was integrated. The data model of abbreviations at the thesaurus level,concept level and term level was designed and the meta-data plan was formulated according to the ISO25964 standard data model, which can implement the standardized abbreviations and their full names,the other abbreviations and their full names,automatic identification of common terms and term types,standardized expression and description of abbreviation concepts in the integrated thesaurus and abbreviations. Results The data model of abbreviations and description of their concepts and terms could rapidly identify the abbreviations with 10 297 concepts,135 423 synonymous terms,121 154 pairs of broader and narrower relationship,the types of term and control the ambiguity of abbreviations in the STKOS ultra-scientific thesaurus. Conclusion The standardized control plan of abbreviations and its establishing strategies can rapidly identify the abbreviations and their relevant terms,reveal the meanings of more fined particles in the concepts and control the ambiguity of abbreviations on the one hand,and rapidly establish the abbreviations with rich semantics and their standardized description on the other hand,which can thus help computers to understand the semantics in knowledge organization system.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7