基于子带可控响应功率的多声源定位方法  被引量:6

Method for multiple speech source localization based on sub-band steered response power

在线阅读下载全文

作  者:倪志莲[1] 蔡卫平[1] 张怡典[1] 

机构地区:[1]九江职业技术学院电气工程学院,江西九江332007

出  处:《计算机工程与应用》2013年第24期205-209,共5页Computer Engineering and Applications

基  金:国家自然科学基金(No.60971098)

摘  要:为了提高多个说话人情况下麦克风阵列的定位性能,提出基于子带可控响应功率的多声源定位算法。该算法将语音信号频域分为7个子带,在每个子带计算相位变换加权的可控响应功率函数,在声源空间搜索其最大值得到声源位置的初始估计。根据语音信号频率的稀疏性,这些初始估计包含多个声源的位置,运用会聚聚类算法得到最终的声源位置估计。仿真和实验表明,在有2个说话人,10 dB信噪比,较强混响的条件下,该算法比传统算法的定位正确率提高了约4%,额外率降低了约7%。To improve localization performance of microphone array in the case of multiple speakers, a method for multiple speech source localization based on subband steered response power is presented. In this method, speech signal is divided into seven subbands in frequency domain, and the steered response powerphase transform functions are computed in each subband. Then initial estimations of source location are generated by searching the maximum value for each function in the source space. According to the frequency sparsity characteristic for speech signal, these initial estimations include multiple source locations. The final source location estimations are produced from them using agglomerative clustering. Simulation and experiment results show that the proposed algorithm facilitates about 4% increase in localization correct rate and about 7% reduction in localization extra rate compared with the conventional algorithm under the conditions of two speakers, 10 dB signaltonoise ratio and mod erate reverberation.

关 键 词:麦克风阵列 多声源定位 子带可控响应功率 聚类 

分 类 号:TN912-3[电子电信—通信与信息系统]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象