解卷积混合语音频域盲分离的次序问题新方法  被引量:1

Approach to Permutation Alignment of Blind Source Separation in Frequency-Domain for Convolutive Mixing Speech

在线阅读下载全文

作  者:吴文妍[1] 张立明[1] 

机构地区:[1]复旦大学电子工程系,上海200433

出  处:《数据采集与处理》2008年第6期734-739,共6页Journal of Data Acquisition and Processing

基  金:国家自然科学基金(NSF60571052)资助项目

摘  要:多通道语音信号的混合往往是卷积混合,瞬时盲分离方法不能获得好的分离效果,而频域方法由于频率次序的问题使性能下降。本文采用时频掩模的方法得到各频点上具有确定次序的、但带有失真的分离信号,将其作为参考,与频域上解得的次序不定信号进行相关,从而获得精确的语音分离信号。实验表明:本文提出的方法能有效地解决频域盲分离的次序不确定性问题,得到精度更高的分离卷积混合的语音信号。In real world, multi-channel speech sources are usually in convolutive mixing environment. Instantaneous blind source separation(BSS) cannot separate the speech sources well. In frequecy-domain the synthetical performance declines due to the permutation problem at different frequency bins. This paper uses time-frequency mask idea to find some rough separated speech sources as references, in which there are some distortions without permutation ambiguity. Then the correlation among the references and the recovered sources by independent component analysis (ICA) is used to solve the permutation problem. Experimental results show that the proposed algorithm can solve permutation problem of recovered sources in frequency-domain and separate the mixed speech sources.

关 键 词:盲信号分离 独立元分析 波达方向 时频二元掩模 

分 类 号:TN912.34[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象