Detection of time varying pitch in tonal languages: an approach based on ensemble empirical mode decomposition  被引量:5

Detection of time varying pitch in tonal languages: an approach based on ensemble empirical mode decomposition

在线阅读下载全文

作  者:Hong HONG Xiao-hua ZHU Wei-min SU Run-tong GENG Xin-long WANG 

机构地区:[1]School of Electronic Engineering and Optoelectronic Techniques,Nanjing University of Science and Technology,Nanjing 210094,China [2]State Key Laboratory of Modern Acoustics,Institute of Acoustics,Nanjing University,Nanjing 210093,China

出  处:《Journal of Zhejiang University-Science C(Computers and Electronics)》2012年第2期139-145,共7页浙江大学学报C辑(计算机与电子(英文版)

基  金:supported by the National Natural Science Foundation of China (No. 10574070);the State Key Laboratory Foundation of China (No. 9140C240207060C24)

摘  要:A method based on ensemble empirical mode decomposition (EEMD) is proposed for accurately detecting the time varying pitch of speech in tonal languages. Unlike frame-, event-, or subspace-based pitch detectors, the time varying information of pitch within the short duration, which is of crucial importance in speech processing of tonal languages, can be accurately extracted. The Chinese Linguistic Data Consortium (CLDC) database for Mandarin Chinese was employed as standard speech data for the evaluation of the effectiveness of the method. It is shown that the proposed method provides more accurate and reliable results, particularly in estimating the tones of non-monotonically varying pitches like the third one in Mandarin Chinese. Also, it is shown that the new method has strong resistance to noise disturbance.A method based on ensemble empirical mode decomposition (EEMD) is proposed for accurately detecting the time varying pitch of speech in tonal languages. Unlike frame-, event-, or subspace-based pitch detectors, the time varying information of pitch within the short duration, which is of crucial importance in speech processing of tonal languages, can be accurately extracted. The Chinese Linguistic Data Consortium (CLDC) database for Mandarin Chinese was employed as standard speech data for the evaluation of the effectiveness of the method. It is shown that the proposed method provides more accurate and reliable results, particularly in estimating the tones of non-monotonically varying pitches like the third one in Mandarin Chinese. Also, it is shown that the new method has strong resistance to noise disturbance.

关 键 词:Ensemble empirical mode decomposition Time varying pitch Tonal language Noise restraint 

分 类 号:TN911.7[电子电信—通信与信息系统] P315.7[电子电信—信息与通信工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象