Om:One tool for many (Indian) languages  

Om: One tool for many (Indian) languages

在线阅读下载全文

作  者:GANAPATHIRAJU Madhavi BALAKRISHNAN Mini BALAKRISHNAN N. REDDY Raj 

机构地区:[1]Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA,Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore 560 012, India,Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore 560 012, India,Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA

出  处:《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》2005年第11期1348-1353,共6页浙江大学学报(英文版)A辑(应用物理与工程)

摘  要:Many different languages are spoken in India, each language being the mother tongue of tens of millions of people. While the languages and scripts are distinct from each other, the grammar and the alphabet are similar to a large extent. One common feature is that all the Indian languages are phonetic in nature. In this paper we describe the development of a translit- eration scheme Om which exploits this phonetic nature of the alphabet. Om uses ASCII characters to represent Indian language alphabets, and thus can be read directly in English, by a large number of users who cannot read script in other Indian languages than their mother tongue. It is also useful in computer applications where local language tools such as email and chat are not yet available. Another significant contribution presented in this paper is the development of a text editor for Indian languages that integrates the Om input for many Indian languages into a word processor such as Microsoft WinWord?. The text editor is also developed on Java? platform that can run on Unix machines as well. We propose this transliteration scheme as a possible standard for Indian language transliteration and keyboard entry.Many different languages are spoken in India, each language being the mother tongue of tens of millions of people. While the languages and scripts are distinct from each other, the grammar and the alphabet are similar to a large extent. One common feature is that all the Indian languages are phonetic in nature. In this paper we describe the development of a transliteration scheme Om which exploits this phonetic nature of the alphabet. Om uses ASCII characters to represent Indian language alphabets, and thus can be read directly in English, by a large number of users who cannot read script in other Indian languages than their mother tongue. It is also useful in computer applications where local language tools such as email and chat are not yet available. Another significant contribution presented in this paper is the development of a text editor for Indian languages that integrates the Om input for many Indian languages into a word processor such as Microsoft Wi nWord. The text editor is also developed on Java platform that can run on Unix machines as well. We propose this transliteration scheme as a possible standard for Indian language transliteration and keyboard entry.

关 键 词:Om transliteration Indian language technologies Text editor 

分 类 号:F49[经济管理—产业经济]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象