检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张玥杰[1] 朱靖波[1] 张跃[1] 姚天顺[1]
机构地区:[1]东北大学信息科学与工程学院计算机系,沈阳110006
出 处:《中文信息学报》2000年第1期13-21,共9页Journal of Chinese Information Processing
基 金:国家自然科学基金!(编号:69675019) ;国家教委博士点专项基金
摘 要:本文提出一种以DOP技术作为基本框架,同时利用基于相似的概率评估技术,实现汉语句法分析的方法。其中,对于输入语句,首先需要经过词汇层与词性层两层初选。然后,基于已构建知识源,获取输入语句的片段组合形式。最后,对输入语句与初选结果进行相似性评估,完成输入语句的组合分析过程。为论证方法有效性,基于包含1 000 个语句的真实汉语语料构建知识源,并采用包含100 个语句的真实汉语语料作为测试集。实验表明,句法分析的各项指标都比较令人满意,可有效地实现汉语句法分析。This paper presents a kind of Chinese parsing method which takes the DOP technique as the basic frame and utilizes the similarity based probabilityestimate technique. In the implementation, every input sentence must by preprocessed through the initial selection in word level and part of speech level. Then the fragment combination forms of the input sentence are acquired based on the constructed knowledge source which includes treebank, fragment bank and fragment combination bank. Finally, the similarity estimate between the input sentence and the initial selection result is proceeded by using the similarity based probability estimate technique. So the combination parsing process of the input sentence can be completed successfully. To prove the efficiency of the proposed method, the knowledge source is constructed based on the real world Chinese corpus which involves 1 000 Chinese sentences, and the other real world Chinese corpus which includes 100 Chinese sentences is used as the test set. The experiment result shows that every test parameter is satisfactory and the parsing process can be implemented efficiently.
关 键 词:DOP 汉语 句法分析 相似性评估 树库 片段库
分 类 号:H146.3[语言文字—汉语] TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222