检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《四川大学学报(自然科学版)》2009年第3期613-617,共5页Journal of Sichuan University(Natural Science Edition)
摘 要:通过对中文机构名的语法语义特性进行分析,将中文机构名分成前部词和特征词,提出了一种基于统计的识别方法。使用成熟语料库的训练数据,计算候选机构名的特征词可信度、前部词首词可信度和前部词中部可信度,最终得到机构名构词可信度,并与给定阈值比较,实现了中文机构名识别,在开放性实验中,达到了85.57%的召回率和94.37%的准确率。By analysing the syntactical and semantical characteristics of Chinese organization and dividing it into the forward word and the special word, an approach based on statistical method is put forward about Chinese organization automatic recognition. The credibilities of both the special word and the forward word for the candidate organization name are computed by using the data from the trained corpus to decide the final credibility of organization name. This final credibility is compared with the given threshold to decide whether it is an organization name. After the primary test, this method can get 85.57% recall, and 94.37% precision.
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7