卷积神经网络在异常声音识别中的研究被引量：19

Research on Abnormal Audio Event Detection Based on Convolutional Neural Networks

作　　者：胡涛张超[1] 程炳吴小培[1] HU Tao ,ZHANG Chao ,CHENG Bing ,WU Xiao-pei(Key Lab of Intelligent Computing and Signal Processing, Anhui University, Hefei, Anhui 230039, Chin)

机构地区：[1]安徽大学计算智能与信号处理教育部重点实验室,安徽合肥230039

出　　处：《信号处理》2018年第3期357-367,共11页Journal of Signal Processing

基　　金：安徽省科技攻关专项(1501b042205)

摘　　要：卷积神经网络(CNNs)已广泛应用于语音识别领域中以改善传统声学模型存在的鲁棒性弱、实时性差、识别性能低等缺点。本文对卷积神经网络在异常声音识别任务中的适用性及其识别性能进行了研究,针对日常常见的6种不同异常声音样本,分析了不同声音特征的维度对卷积神经网络识别性能的影响,还将卷积神经网络分别与高斯混合模型、BP神经网络进行比较。实验结果表明,无噪声条件下,一维特征在卷积神经网络中的平均识别率比二维特征相对提升了2.91%,且误差收敛速度更快,但在有噪声条件下,二维特征的平均识别率比一维特征相对提升了3.41%。同时卷积神经网络比其他两种识别模型在对噪声的鲁棒性和误差收敛速度等方面均有明显的优势。Convolution neural networks （CNNs） have been widely used in the field of speech recognition to make up the deficiency of traditional acoustic models, such as weak robustness, poor real-time and low recognition performance. In this paper, the applicability and recognition performance of abnormal sound recognition based on CNNs were analyzed. Applied on 6 common abnormal sounds, we explored the dimension of sound signal features how to influence the performance of CNNs architecture, as reference methods, the Gaussian Mixture Model （GMM） and Back Propagation neural networks were employed to compare with CNNs algorithm. The experimental results reveled that 1D features produce higher error convergence rate and average accuracies with the relative increase of 2. 91% in the noiseless environment. Nevertheless in noisy context, 2D features perform better, the relative increase reaches 3.41%. Meanwhile, CNNs method has distinct advantage in the terms of noise robustness and error convergence speed over other two approaches.

关键词：卷积神经网络异常声音识别鲁棒性声音特征维度

分类号：TN912.34[电子电信—通信与信息系统]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

卷积神经网络在异常声音识别中的研究被引量：19

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

卷积神经网络在异常声音识别中的研究 被引量：19

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

卷积神经网络在异常声音识别中的研究被引量：19