面向内容的语音信号压缩感知研究  被引量:8

Content-based Compressive Sensing for Speech Signal

在线阅读下载全文

作  者:高畅[1] 李海峰[1] 马琳[1] 

机构地区:[1]哈尔滨工业大学计算机学院,黑龙江哈尔滨150001

出  处:《信号处理》2012年第6期851-858,共8页Journal of Signal Processing

基  金:语言语音教育部-微软重点实验室开放基金资助项目(HIT.KLOF.2011XXX);中央高校基本科研业务费专项资金(HIT.NSRIF.2012047);国家自然科学基金项目(61171186)的支持

摘  要:压缩感知理论依据信号的稀疏性质进行压缩测量,将信号的获取方式从对信号的采样上升为对信息的感知,是信号处理领域的一场革命。本文提出一种基于非确定基字典(Uncertainty Basis Dictionary,UBD)对语音信号进行稀疏表示的方法,将压缩感知理论应用于对语音信号稀疏表示的压缩,并提出了基于求解线性规划问题的方法重构语音信号的算法。通过语音识别、话者识别和情感识别实验,从面向内容分析的角度,研究这种基于压缩感知理论的信息感知方法是否保留了语音信号的主要内容。实验结果表明,语音识别、话者识别和情感识别的准确率,与目前这些领域研究方法得到的结果基本一致,说明基于压缩感知理论的信息感知方法能够很好地获取语音信号的语义、话者和情感方面的信息。Compressive sensing theory compress measurements using sparsity of signal,changes the method of signal obtaining from signal sampling to information sensing,and is a revolution of signal processing.The speech signal is sparse represented based on Uncertainty Basis Dictionary proposed in this paper,the sparse representation of speech signal is compressed by compressive sensing theory,and proposes an speech signal reconstruction algorithm based on the method of solving linear programming problem.Through the experiments of audio,speaker and emotion recognition,we research that this information sensing method based on compressive sensing theory weather preserves the main content from the angle of content-based analysis.Experiment results show that the precision of audio,speaker and emotion recognition is general the same with methods in these research domain,and proves that it can acquire the audio,speaker and emotion information of speech signal using the information sensing method based on compressive sensing theory.

关 键 词:压缩感知 语音信号 稀疏表示 线性规划 信息感知 

分 类 号:TP391.42[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象