说话人识别中用模型合成的编码畸变补偿研究

Coding distortion compensation of speaker identification based on model synthesis

出　　处：《计算机工程与应用》2011年第3期135-138,共4页Computer Engineering and Applications

基　　金：国家重点基础研究发展规划(973)No.2007CB311100~~

摘　　要：编码环境失配是影响说话人识别准确率的重要因素之一。在说话人识别系统上,对码速率在5.15～128 Kb/s之间的语音编码进行了实验分析,结果表明,高速率语音编码对说话人识别系统的影响不大,低速率语音编码使系统性能急剧下降。针对这一问题,采用基于UBM的说话人模型合成算法对低速率语音编码的说话人模型进行补偿,在NIST 2002单说话人识别数据库上的实验表明,此方法能显著提高系统识别率。Environment mismatch in enrollment and test sessions caused by different code strategies is one of main reasons degrading the performance of speaker recognition.Experiments with speech in different code formats and code rate raging from 5.15 Kb/s to 128 Kb/s show that the speech with high-bit rate causes little distortion,while the ones with low-bit rate make the recognition rate decreasing sharply.To solve this problem,speaker model synthesis based on UBM is adopted to synthesis speaker models for target code environments to compensate the distortion caused by low-bit rate.Experiments on NIST 2002 corpus in one-speaker detection task show that the proposed approach obtains better performance than those with no compensation.

关键词：语音编码说话人识别低速率模型合成

分类号：TN912.3[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

说话人识别中用模型合成的编码畸变补偿研究

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

说话人识别中用模型合成的编码畸变补偿研究

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索