采用经验模态分解的语音与音频通用编码方法  

A Unified Speech and Audio Coding with Empirical Mode Decomposition

在线阅读下载全文

作  者:李晓明[1] 鲍长春[1] 

机构地区:[1]北京工业大学电子信息与控制工程学院语音与音频信号处理研究室,北京100124

出  处:《信号处理》2013年第10期1274-1282,共9页Journal of Signal Processing

基  金:国家自然科学基金资助项目(61072089;61201197);北京市教育委员会科技发展计划重点项目(KZ201110005005)~~

摘  要:为有效解决现有单一模型编码器无法在中低速率对语音和音频信号进行高质量通用编码的问题,本文借助语音与音频信号的谐波特性,建立了一种对语音和音频信号统一编码的方法。首先,本文利用经验模态分解(Empirical Mode Decomposition,EMD)提取输入信号的谐波成分;其次,利用感知匹配追踪算法,并结合正弦参数建模对谐波成分进行参数提取与量化;第三,对于量化谐波后的残差进行抖动格型矢量量化,以提升重建音频的主观听觉质量,并最终实现一套包含24kbps和32kbps码率的宽带语音与音频通用编码器;最后,对所提算法进行了客观PESQ/PEAQ和主观A/B测试,并与ITU-T G.722.1和G.722.2编码器进行了比较,实验结果表明,所提编码器对语音和音频信号的编码质量均优于参考编码器。In this paper,a unified speech and audio coding method that based on Empirical Mode Decomposition (EMD) by exploiting the harmonic structure of input signal was proposed.This coder can achieve a high performance for both speech and audio signals at low and medium bitrates,which cannot be done by the codec with one single analysis model.Prior to the quantization,the EMD was adopted to extract the harmonic components of the input signal,after this,the extracted harmonic signal was modeled and quantized by sinusoidal model and perceptual weighted matching pursuit.For the quantization residual of harmonic signal,the dithered lattice vector quantization was used to improve the subjective quality.Finally,both the objective PESQ/PEAQ results and subjective A/B listening tests show that the proposed coder outperforms the ITU-T G.722.1 and G.722.2 codec.

关 键 词:语音编码 音频编码 经验模态分解 感知匹配追踪 抖动格型矢量量化 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象