汉语数字耳语音识别研究  被引量:2

Speech Recognition of Chinese Whispered Speech

在线阅读下载全文

作  者:邓秀慧[1,2] 

机构地区:[1]南京工程学院计算机工程学院,江苏南京211167 [2]河海大学计算机与信息学院,江苏南京210098

出  处:《电声技术》2014年第7期47-50,共4页Audio Engineering

基  金:国家自然科学基金项目(51101086)

摘  要:耳语音识别可应用于国家安全的某些特殊需要。运用双门限法对语音样本进行端点检测,通过实验分别找出短时能量、短时过零率的高低门限4个参数的最佳取值。深入分析研究参数的抗噪问题,在MFCC参数中引入短时能量、一阶差分、二阶差分等参数,增强MFCC的抗噪性。研究表明,在隐马尔可夫模型中,MFCC和LPCC联合运用讨论识别效果要远优于独立参数。The whispered speech recognition even can be applied in the field of national security. In this paper,the characteristics of whispered speech in physiology and acoustics are introduced. The whispered speech is a noise sound source,the resonance peaks are offset,to recognize it more difficult than normal speech. The dual- threshold method of endpoint detection of voice samples is used,respectively,through experiments to identify the best value of the four parameters of short- time energy,short- time zero- crossing rate threshold. Depth analysis of the parameters of anti- noise problem; the introduction of short- time energy,first- order differential,second- order differential parameters and any other parameters in MFCC is made to enhance the anti- noise ability. The effect on recognition of joint use MFCC is much better than that and LPCC in HMM.

关 键 词:语音识别 耳语音 识别研究 

分 类 号:O429[理学—声学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象