基于声源方位信息和非线性时频掩蔽的语音盲提取算法被引量：10

Speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking

出　　处：《声学学报》2013年第2期224-230,共7页Acta Acustica

基　　金：国家自然科学基金(61071159)资助项目

摘　　要：针对欠定卷积混合的语音信号模型,提出一种基于声源方位信息和非线性时频掩蔽的语音盲提取算法。首先对低频段混合语音信号进行时频分析估计瞬时相对时延(ITD)并采用势函数聚类分析方法估计出声源个数及其ITD,接着锁定目标提取准确的目标语音方位信息,最后利用独立语音在时频域上的近似W一分离正交性,采用非线性时频掩蔽的方法提取目标语音。仿真实验表明,该方法能锁定任意感兴趣目标方位,能有效提取目标语音,文中实验条件下信噪比增益平均达9.5 dB。For the underdetermined convolution mixture model, a new speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking was proposed. At first, instantaneous ITDs were calculated through time-frequency analysis in lower frequency domain, and the number of sources and their ITDs were estimated using the potential function. Then the object source was locked and accurate azimuth information of object was estimated. At last, the object speech was extracted via nonlinear time-frequency masking which was based on the azimuth information of object. Simulation results showed that our proposed speech extraction algorithm can lock interested object speech from random direction and extract object speech effectively, the signal-noise-ratio gain （SNRG） was obtained 9.5 dB averagely in our experiment condition.

关键词：盲提取算法语音信号时频分析方位信息非线性掩蔽声源聚类分析方法

分类号：TN912.3[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于声源方位信息和非线性时频掩蔽的语音盲提取算法被引量：10

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于声源方位信息和非线性时频掩蔽的语音盲提取算法 被引量：10

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于声源方位信息和非线性时频掩蔽的语音盲提取算法被引量：10