检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中国农业科学院科技文献信息中心
出 处:《情报学报》1995年第5期329-334,共6页Journal of the China Society for Scientific and Technical Information
摘 要:本文介绍了一个中文农业文献自动标引系统SDIC/CASDAIS,它集自动主题标引与自动分类标引于一体,采用主题词表、预匹配词表和停用词表相结合的词典法方案,匹配中采取正向增字跳字最长匹配的算法,末二字回溯,制订大量规则以降低错标。该系统可完成主题标引和分类标引,能处理农业文献中常见的缩略语和科技术语不规范现象,具备动态构词功能。SDIC/CASDAIS系统采用特征词析取方法处理不包含在词表中的品种、物质名称和地名等关键词,其自由词判定规则还可以判别标题的部分自由词,通过词频统计可作为更新词表的依据。SDIC/CASDAIS系统的标引速度为3000条标题/小时,平均标引深度略大于4,主题标引精度98%,分类标引基本吻合率80%。An automatic indexing system for Chinese document of agriculture science and technology,SDIC/CASDAIS,is discussed in this paper, As a dicti-onary method based system,SDIC/CASDAIS uses a subject word dictionary,a st-op word dictionary,and a so-called prematch word dictionary,adopts Direct Ch-aracter Changable Maximum method in word matching,recalls for the last two characters.Knowledge rules are used in SDIC/CASDAIS in order to reduce error indexing,SDIC/CASDAIS combines classification indexing with subject word in-dexing, can solve abbreviation words and uncanonical technical terms which used widely in agriculture literatures,and has the ability of dynamic word construction. SDIC/CASDAIS developed a characteristic word dissect method to index keywo-rds which not included in dictionary,such as organism variety name, place name, chemical substance name, etc, and, depending on it,s free word judgement rules,free words in title can also be indexed.The index speed of SDIC/CASDAIS is 3000 titles per hour,average index depth is 4, precision of subject word index is near 98%,and coincide ratio of classification index is 80%.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.31