知识组织体系互操作中的缩略语语义控制与规范  

Semantic control and standardization of abbreviations in interoperating knowledge organization system

在线阅读下载全文

作  者:邓盼盼[1] 孙海霞[1] DEND Pan-pan;SUN Hai-xia(Institute of Medical Information,Chinese Academy of Medical Sciences,Beijing 100020,China)

机构地区:[1]中国医学科学院医学信息研究所,北京100020

出  处:《中华医学图书情报杂志》2020年第1期12-21,共10页Chinese Journal of Medical Library and Information Science

基  金:国家科技图书文献中心专项任务“NSTL统一资源分类体系建设”(2019XM10-03);中国医学科学院医学与健康科技创新工程“中文临床医学术语系统构建研究”(2017-I2M-3-014);NSTL下一代科技创新开放系统先期研发任务“STKOS自动构建与维护关键技术研究”(XQYF0102)。

摘  要:目标:实现缩略语规范控制与管理,减少缩略语歧义造成的文本理解错误。方法:以跨领域、多来源词表集成时缩略语歧义术语控制为目标场景,基于缩略语构词语法特征,制定集成词表系统中缩略语的识别与提取方法;参考ISO 25964标准数据模型,开展缩略语词表层面、概念层、术语层数据模型和元数据方案设计并实现集成词表及缩略语表概念中的规范缩略语、规范全称、其他缩略语及其全称、普通术语的自动识别与术语类型自动标识,以及缩略语概念的规范表达与描述。结果:缩略语表、概念和术语描述的数据模型与元数据方案可快速实现10297个概念、135423个同义术语、121154对广义和窄义关系的缩略语表构建,并对STKOS超级科技词表中缩略语所在概念的全部术语实现了术语类型标识与歧义控制。结论:提出的缩略语规范控制方案和构建策略,一方面可快速标识缩略语及相关术语,实现概念内更细颗粒度的含义揭示与歧义控制;另一方面可快速构建具有丰富语义的缩略语表,并进行规范描述,促进计算机对领域缩略语的语义理解。Objective To reduce the abbreviation ambiguity-induced misunderstanding of texts by implementing the standardized control and management of abbreviations. Methods The identification and extraction methods of abbreviations in integrated thesaurus system were worked out based on their grammatical characteristics with the control of abbreviation ambiguity as its target scenario when the cross-domain and multi-source thesaurus was integrated. The data model of abbreviations at the thesaurus level,concept level and term level was designed and the meta-data plan was formulated according to the ISO25964 standard data model, which can implement the standardized abbreviations and their full names,the other abbreviations and their full names,automatic identification of common terms and term types,standardized expression and description of abbreviation concepts in the integrated thesaurus and abbreviations. Results The data model of abbreviations and description of their concepts and terms could rapidly identify the abbreviations with 10 297 concepts,135 423 synonymous terms,121 154 pairs of broader and narrower relationship,the types of term and control the ambiguity of abbreviations in the STKOS ultra-scientific thesaurus. Conclusion The standardized control plan of abbreviations and its establishing strategies can rapidly identify the abbreviations and their relevant terms,reveal the meanings of more fined particles in the concepts and control the ambiguity of abbreviations on the one hand,and rapidly establish the abbreviations with rich semantics and their standardized description on the other hand,which can thus help computers to understand the semantics in knowledge organization system.

关 键 词:缩略语 歧义控制 语义描述 集成词表系统 

分 类 号:G254.2[文化科学—图书馆学] R-058[医药卫生]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象