一种基于实例的汉英机器翻译策略  被引量:5

Example-based Chinese-English machine translation strategy

在线阅读下载全文

作  者:胡国全[1] 陈家骏[1] 戴新宇[1] 尹存燕[1] 

机构地区:[1]南京大学计算机软件新技术国家重点实验室,江苏南京210093

出  处:《计算机工程与设计》2005年第4期900-903,906,共5页Computer Engineering and Design

基  金:国家863高技术研究发展基金项目(2001AA117010)

摘  要:介绍了一种基于实例的汉英机器翻译策略,重点讨论了汉英双语语料库的设计和基于该语料库的汉语句子的匹配算法。在进行汉语句子的匹配时,根据汉语的特点直接采用汉字的匹配,而没有进行汉语句子的分词。另外,匹配时确定匹配片断的边界也是基于实例机器翻译的难点之一,在这方面也采取了相应的解决方法。没有对翻译句子的连接装配进行更深入的研究,这是因为该翻译策略是用于多翻译引擎系统的,它要与其它翻译策略配合使用,以提高翻译结果的正确率。基于实例的机器翻译需要大量的双语语料库作为翻译时的依据,而人工建设大型语料库费时费力,所以尝试采用计算机进行汉英双语语料库的自动建立,包括篇章对齐和单词级的对齐。A Chinese-English machine translation strategy is presented based on EBMT (Example-based machine translation) technique. EBMT systems have two main difficult issues: determining fragment's boundary in matching process and establishing bilingual-corpus. When Chinese being processed, words are not analyzed. Some statistical methods are used to align sentences and words, for example, using co-occurrence frequency. By considering the characteristics of Chinese, two Chinese sentences are matched in terms of Chinese characters. About boundary determination, an appropriate measure to solve it. Assembling matching fragments have not been studied. This translation strategy is meant to be used as one of the engines in a multi-engine translation system. It is a very difficulttask to construct a big bilingual-corpus manually, so computer is tried to use to process it automatically. It includes automatic alignment of bilingual sentences and words.

关 键 词:自然语言处理 机器翻译 实例 EBMT技术 

分 类 号:TP391.2[自动化与计算机技术—计算机应用技术] H085[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象