检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:LIANG Yinghong ZHAO Tiejun YAO Jianmin YU Hao
机构地区:[1]MOE-MS Key Laboratory of Natural Language Processing and Speech, Harbin Institute of Technology, Harbin 150001, China [2]School of Information and Computer Engineering, North East Forestry University, Harbin 150080, China
出 处:《Chinese Journal of Electronics》2006年第3期422-426,共5页电子学报(英文版)
基 金:This work is supported by the National Natural Science Foundation of China (No.60375019, 60373101); the International Cooperate Project of Sino-Ireland (No.CI-2003-03).
摘 要:The traditional English text chunking approach identifies phrases by using only one model and phrases with the same types of features. It has been shown that the limitations of using only one model are that: the use of the same types of features is not suitable for all phrases, and data sparseness may also result. In this paper, a Multi-Agent strategy is proposed and applied in the identification of English phrases. And then, this strategy is rapidly transplanted to Chinese text chunking. This strategy puts phrases into agents according to their sensitive features and identifies different phrases in parallel, where the main features are: first, easy and quick communication between phrases; second, avoidance of data sparseness. Through testing on public corpus (English) and Chinese Penn Treebank (Chinese), F score of English chunking achieves to 95.70% and that of Chinese chunking is 95.25%. These results are higher than the best results that have been reported.
关 键 词:Text chunking Sensitive features Multiagent strategy Communication.
分 类 号:TP31[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.112