基于共振峰合成和韵律调整的语音验证码方法研究  被引量:4

Research on formant synthesis and prosody adjustment-based speech validation codes

在线阅读下载全文

作  者:汪成亮[1] 张玉维[1] 

机构地区:[1]重庆大学计算机学院,重庆400044

出  处:《计算机应用研究》2011年第7期2458-2461,共4页Application Research of Computers

基  金:国家自然科学基金资助项目(61004112);中国博士后科学基金资助项目(20080430750)

摘  要:为了提高语音验证技术的有效性,提出了一种基于共振峰合成、修改时长和调节韵律的随机语音验证码生成方法。该方法选择音素作为语音合成单元,基于规则在合成过程中设定随机语速参数,以及调整单元之间的连接规则来实现韵律的随机调整,使得语速和韵律具有不确定性和不可预测性,从而有效降低了自动语音识别技术(ASR)对语音码的识别率,增强了语音验证码的抗攻击性。合成的语音验证码的人耳识别率达到了90%左右,ASR的识别率为28.8%,主观平均判分(MOS)为4分,语音码的可懂度和清晰度达到了满意的效果。实验结果验证了所提方法的可行性。In order to improve the effectiveness of speech verification technology, this paper proposed a method of speech validation codes based on formant synthesis, time scale modification and prosody regulation. This method chose phonemes as speech synthesis units and set parameters for speed regulations in the synthesis process based on rules, which adjusted the con- nection rules between units to achieve a random prosody regulation. Due to the uncertainty of speed and prosody, for speech val- idation codes, this method effectively reduced recognition rate of automatic speech recognition and enhanced resistance to at- tack. The recognition rate of synthesized speech validation codes was 90% for human ear, and 28.8% for automatic speech recognition software. The mean opinion score (MOS) was 4 points. Both intelligibility and articulation of the synthesis speech were satisfied. The experimental results confirm the practicality of the proposed method.

关 键 词:语音合成 验证码 共振峰合成 韵律调整 时长规整 

分 类 号:TN912.33[电子电信—通信与信息系统] TP309[电子电信—信息与通信工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象