基于声源时延估计的欠定盲分离方法  被引量:1

Underdetermined Blind Separation Based on Sound Source Time-Delay Estimation

在线阅读下载全文

作  者:张华[1] 冯大政[1] 庞继勇[2] 

机构地区:[1]西安电子科技大学雷达信号处理国家重点实验室,西安710071 [2]西安电子科技大学ISN国家重点实验室,西安710071

出  处:《数据采集与处理》2009年第6期703-708,共6页Journal of Data Acquisition and Processing

基  金:国家自然科学基金(60672128;60702057)资助项目;国家高技术研究发展计划"八六三"计划(2007AA01Z288)资助项目

摘  要:提出一种基于声源时延估计的二元时频掩蔽方法,通过三个接收信号实现多于多个语音源信号的欠定盲分离。利用语音信号的W-分离正交性,在时频域估计各个源信号到达接收阵列的相对时延序列;进而基于信号时延序列的估计,采用最大似然算法将时频域划分为与源信号个数相同的互不重叠的时频点集合,每个集合(近似)只包含一个源信号的所有时频分量;再通过二元时频掩蔽依次恢复出各集合所对应的源信号。该方法性能通过主观试听得到了验证,其分段信噪比增益至少为13 dB。较之欠定解混迭估计技术DUET,本文方法得到的分离信号与实际声源信号的相异度降低约3 dB。Based on time-delay estimation, a time-frequency masking method is proposed for underdetermined blind source separation. The method can realize the blind separation more than 3 source signals by using only 3 received array elements. Firstly, relative time-delay sequences of all sources are estimated in time-frequency domain by virtue of the W-disjoint orthogonality of speech signals. Secondly, based on the estimated time-delay sequences, the maximum likelihood method is used to estimate the support domain of each signal. The timefrequency components in each support domain belong to only one signal approximatively, and different support domains are mutually disjoint. Finally, the time-frequency representation of each signal is obtained by the time-frequency masking, and then the time-domain source signals are retrieved. The experiments illustrate that the method is validated by the informal subjective measure, and the gain of segment signal-to-noise ratio is at least 13dB. Compared with the degenerate unmixing estimation technique, the separation performance of the proposed method improves about 3dB measured by signal dissimilarities.

关 键 词:欠定盲分离 时延估计 W-分离正交性 最大似然 时频掩蔽 

分 类 号:TN912.3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象