Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition  

在线阅读下载全文

作  者:Zhijie Wang Yue Zhao Licheng Wu Xiaojun Bi Zhuoma Dawa Qiang Ji 

机构地区:[1]School of Information Engineering,Minzu University of China,Beijing,100081,China [2]School of Chinese Ethnic Minority Languages and Literatures,Minzu University of China,Beijing,100081,China [3]Department of Electrical,Computer,and Systems Engineering,Rensselaer Polytechnic Institute,Troy,NY 12180-3590,USA

出  处:《Computers, Materials & Continua》2022年第10期629-639,共11页计算机、材料和连续体(英文)

基  金:This work was supported by three projects.Zhao Y received the Grant with Nos.61976236 and 2020MDJC06;Bi X J received the Grant with No.20&ZD279.

摘  要:As one of Chinese minority languages,Tibetan speech recognition technology was not researched upon as extensively as Chinese and English were until recently.This,along with the relatively small Tibetan corpus,has resulted in an unsatisfying performance of Tibetan speech recognition based on an end-to-end model.This paper aims to achieve an accurate Tibetan speech recognition using a small amount of Tibetan training data.We demonstrate effective methods of Tibetan end-to-end speech recognition via cross-language transfer learning from three aspects:modeling unit selection,transfer learning method,and source language selection.Experimental results show that the Chinese-Tibetan multi-language learning method using multilanguage character set as the modeling unit yields the best performance on Tibetan Character Error Rate(CER)at 27.3%,which is reduced by 26.1%compared to the language-specific model.And our method also achieves the 2.2%higher accuracy using less amount of data compared with the method using Tibetan multi-dialect transfer learning under the same model structure and data set.

关 键 词:Cross-language transfer learning low-resource language modeling unit Tibetan speech recognition 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术] H214[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象