自适应秩约束逆矩阵近似分解及其在语音增强中的应用  

Adaptive rank constrained inverse matrix approximation decomposition method and its application in speech enhancement

在线阅读下载全文

作  者:王强进 吴占涛[1,2,3] 李宝庆 杨宇[1,2] Wang Qiangjin;Wu Zhantao;Li Baoqing;Yang Yu(College of Mechanical&Vehicle Engineering,Hunan University,Changsha 410000,China;Hunan Provincial Key Laboratory of Equipment Service Quality Assurance,Hunan University,Changsha 410000,China;Hunan Provincial Key Laboratory of Construction Machinery Intelligence Technology Based on the Internet of Things,Hunan University,Changsha 410000,China)

机构地区:[1]湖南大学机械与运载工程学院,长沙410000 [2]湖南大学装备服役质量保障湖南省重点实验室,长沙410000 [3]湖南大学基于物联网的工程机械智能化技术湖南省重点实验室,长沙410000

出  处:《计算机应用研究》2024年第11期3389-3393,共5页Application Research of Computers

基  金:国家自然科学基金资助项目(51975193);湖南省普通高等学校教学改革研究项目(HNJG-2022-0520);湖南大学本科教育教学改革专项(2024)。

摘  要:针对低秩约束和稀疏矩阵分解(constrained low-rank and sparse matrix decomposition, CLSMD)方法中硬阈值可能导致降噪后的语音信号分量丢失或出现孤立噪声问题,提出了一种自适应秩约束逆矩阵近似(adaptive rank constrained inverse matrix approximation, ARCIMA)分解方法。该方法首先采用能量阈值法初步估计低秩矩阵秩值,然后从语音信号子空间矩阵的结构特性出发,采用修正双边随机投影(modified bilateral random projections, MBRP)方法求解代表纯净语音信号的低秩矩阵,降低使用SVD方法的计算量,并通过Tikhonov正则化优化方法改善迭代求解过程中解的病态性。实验结果表明,该方法相比经典方法在多种噪声环境下取得了更好的PESQ得分,并且增强语音的时域波形也更接近原始信号的波形。该方法去噪性能在低信噪比噪声条件下具有优势。This paper proposed a new matrix decomposition method ARCIMA to address issues in the CLSMD approach,where hard thresholding could lead to loss of speech signal components or isolated noise problems.Initially,the energy thres-hold method estimated the rank of the low-rank matrix.Then,considering the structural characteristics of the speech signal subspace matrix,the MBRP method solved the low-rank matrix representing the clean speech signal,reducing the computational load compared to the SVD method.Tikhonov regularization optimized the solution’s stability during iterative solving.Experimental results show that this method achieves better PESQ scores in various noisy environments compared to classical methods,and the enhanced speech waveform is closer to the original speech waveform.The method demonstrates superior denoising performance under low signal-to-noise ratio conditions.

关 键 词:自适应秩约束逆矩阵近似 修正双边随机投影 语音增强 

分 类 号:TP912.35[自动化与计算机技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象