检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:胡涛 张超[1] 程炳 吴小培[1] HU Tao ,ZHANG Chao ,CHENG Bing ,WU Xiao-pei(Key Lab of Intelligent Computing and Signal Processing, Anhui University, Hefei, Anhui 230039, Chin)
机构地区:[1]安徽大学计算智能与信号处理教育部重点实验室,安徽合肥230039
出 处:《信号处理》2018年第3期357-367,共11页Journal of Signal Processing
基 金:安徽省科技攻关专项(1501b042205)
摘 要:卷积神经网络(CNNs)已广泛应用于语音识别领域中以改善传统声学模型存在的鲁棒性弱、实时性差、识别性能低等缺点。本文对卷积神经网络在异常声音识别任务中的适用性及其识别性能进行了研究,针对日常常见的6种不同异常声音样本,分析了不同声音特征的维度对卷积神经网络识别性能的影响,还将卷积神经网络分别与高斯混合模型、BP神经网络进行比较。实验结果表明,无噪声条件下,一维特征在卷积神经网络中的平均识别率比二维特征相对提升了2.91%,且误差收敛速度更快,但在有噪声条件下,二维特征的平均识别率比一维特征相对提升了3.41%。同时卷积神经网络比其他两种识别模型在对噪声的鲁棒性和误差收敛速度等方面均有明显的优势。Convolution neural networks (CNNs) have been widely used in the field of speech recognition to make up the deficiency of traditional acoustic models, such as weak robustness, poor real-time and low recognition performance. In this paper, the applicability and recognition performance of abnormal sound recognition based on CNNs were analyzed. Applied on 6 common abnormal sounds, we explored the dimension of sound signal features how to influence the performance of CNNs architecture, as reference methods, the Gaussian Mixture Model (GMM) and Back Propagation neural networks were employed to compare with CNNs algorithm. The experimental results reveled that 1D features produce higher error convergence rate and average accuracies with the relative increase of 2. 91% in the noiseless environment. Nevertheless in noisy context, 2D features perform better, the relative increase reaches 3.41%. Meanwhile, CNNs method has distinct advantage in the terms of noise robustness and error convergence speed over other two approaches.
关 键 词:卷积神经网络 异常声音识别 鲁棒性 声音特征维度
分 类 号:TN912.34[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.185