现状和设想——试论中文信息处理与现代汉语研究  被引量:21

The-State-of-the-Art and the Related Strategic Considerations ——On the Studies of Chinese Information Processing and Contemporary Chinese Language

在线阅读下载全文

作  者:许嘉璐[1] 

机构地区:[1]全国人民代表大会常务委员会,北京100805

出  处:《中文信息学报》2001年第2期1-8,共8页Journal of Chinese Information Processing

摘  要:本文介绍了中文信息处理技术发展的现状及面临的主要困难 ,指出 :关键在于对现代汉语研究的滞后。到目前为止 ,中文信息处理主要依赖于对大规模语料的统计 ,根据概率 ,对词与词的关系作出界定。多年来中文信息处理技术徘徊难进的现实说明 ,这一方法已经难以突破“瓶颈” ,要使计算机对现代汉语进行自动化的处理 ,即使之真正“智能化” ,就必须把人的语言知识“教”给计算机。这就需要根据计算机的要求加强对现代汉语的研究 ,特别是对语义的研究。文中介绍了当前朝此方向努力并已有较大进展的三个流派 ,并分别指出其不足 ;参考作者主持国家“九五”重点项目“信息处理用现代汉语词汇研究”的经验 ,提出了统一使用资源、携手并进。The paper surveys the state of the art of Chinese information processing and the major obstacles being faced currently, pointing out that the underlying factor to block the development of Chinese information processing is the lag of the systematic and in depth study on contemporary Chinese language. The main stream in Chinese information processing community depends heavily on corpus based methods, by making full use of the statistical relationship among words, in recent years. The fact that the Chinese information processing technique progress slowly shows that the above scheme has very strong limitations. To make computer more intelligent and capable of coping with Chinese texts automatically, we must teach more linguistic knowledge to it. This requires us to strengthen the research in Chinese language, particularly in semantics, to the maximum extent from the perspective of computation. Regarding this, the author continues to talk about the ongoing research work in China, which is being promoted actively along with three distinct technical lines, discusses its advantage and disadvantage respectively, and concludes with a proposed strategy of“sharing resources nation wide, doing research hand in hand, and tackle key problem jointly', in terms of the experience obtained from the national 95 key project“Study on Lexicology of Contemporary Chinese Language for Information Processing ' supervised by the author.

关 键 词:中文信息处理 现代汉语研究 战略性设想 计算机处理 汉语词汇 语料统计 

分 类 号:TP391.12[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象