检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:解元 邹涛 孙为军 谢胜利[2] XIE Yuan;ZOU Tao;SUN Weijun;XIE Shengli(School of Mechanical and Electrical Engineering,Guangzhou University,Guangzhou 510006,China;Key Laboratory of Intelligent Information Processing and System Integration of Internet of Things,Ministry of Education,Guangdong University of Technology,Guangzhou 510006,China)
机构地区:[1]广州大学机械与电气工程学院,广东广州510006 [2]广东工业大学物联网智能信息处理与系统集成教育部重点实验室,广东广州510006
出 处:《通信学报》2024年第11期15-26,共12页Journal on Communications
基 金:广州市基础与应用基础研究基金资助项目(No.SL2022A04J00289);国家自然科学基金资助项目(No.62003095,No.52171331);广东省基础与应用基础研究基金资助项目(No.2023A1515011311);广州市市校联合实验室基金资助项目(No.2023A03J0120)。
摘 要:为了解决带混响和噪声场景下的语音增强问题,构建了一个集成多通道线性预测模型和空间相干模型的语音增强模型,设计了一种基于混合混响模型的多通道语音增强算法。该算法将后期混响分为2个分量,分别用多通道线性预测模型和空间相干模型来建模,为优化模型参数,利用卡尔曼滤波器实施更新模型参数,并用多项式矩阵特征值分解进行空间、时间和频率解相关,实现去混响去噪声。实验结果表明,所提算法可以实现高低混响带噪声环境下的语音增强,相比于流行的语音增强算法,其增强效果更优越,其中语音质量客观评价(PESQ)值和短时客观可懂度(STOI)值最高分别提高了30%和20%。To solve the speech enhancement problem in reverberation and noise scenarios,a new speech enhancement model was constructed integrating multichannel linear prediction model and spatial coherence model,and then a multi‐channel speech enhancement algorithm based on a hybrid reverberation model was designed.The post-reverberation was divided into two components,which were modeled using a multichannel linear prediction model and a spatial coherence model,respectively.To optimize the model parameters,a Kalman filter was used to update the model parameters and polynomial matrix eigenvalue decomposition was used for spatial,temporal,and frequency decorrelation to achieve re‐verberation and noise reduction.Experimental results show that the proposed algorithm can enhance speech in high and low-reverberation noise environments,and its enhancement effect is superior to popular speech enhancement algorithms,the performance indicators of speech enhancement,perceptual evaluation of speech quality score(PESQ)value and short-time objective intelligibility(STOI)value,have increased by 30%and 20%,respectively.
关 键 词:多通道语音增强 卡尔曼滤波器 多项式矩阵特征值分解
分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.13