Approach for Multiword Expression Identification in Natural Language Processing  

Approach for Multiword Expression Identification in Natural Language Processing

在线阅读下载全文

作  者:Deepak Sharma Prakash R. Devale Akhil K. Khare 

机构地区:[1]Department of lnformation Technology and Bharati Vidyapeeth College of Engineering & Research, Pune 411043, India

出  处:《Computer Technology and Application》2011年第8期663-666,共4页计算机技术与应用(英文版)

摘  要:In this paper, the authors are presenting the approach to extract the multiword expression (MWEs) from monolingual corpora. It both validates and generates multiword candidates. The multiword expression provides a list of candidates which are extracted and filtered according to the number of criteria and a set of standard statistical association measures. The generation of the multiword candidates is based on the surface forms, while the validation consists of series of criteria for removing noise using language independent association measures. For generating corpus count, it provides both a corpus indexation facility. Also, this approach allows easy integration with a machine learning tool for thecreation and application of supervised multiword extraction models if annotated data is available. The authors present the use of multiword in a standard configuration, for extracting MWEs from a corpus of general purpose English.

关 键 词:Multiword candidates association measures surface forms monolingual corpora. 

分 类 号:TP391[自动化与计算机技术—计算机应用技术] X32-65[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象