检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:林云[1] 徐怀韬 王森 张思成 庄龙 LIN Yun;XU Huaitao;WANG Sen;ZHANG Sicheng;ZHUANG Long(College of Information and Communication Engineering,Harbin Engineering University,Harbin 150001,China;School of Integrated Circuits,Anhui University,Hefei 230039,China)
机构地区:[1]哈尔滨工程大学信息与通信工程学院,黑龙江哈尔滨150001 [2]安徽大学集成电路学院,安徽合肥230039
出 处:《通信学报》2023年第3期105-116,共12页Journal on Communications
基 金:国家自然科学基金资助项目(No.62201172);中央高校基本科研业务费专项资金资助项目(No.3072022CF0804,No.3072022CF0601)。
摘 要:针对通信语音干扰效果客观评估问题,提出了基于多测度与多模态融合的2种评估方法。首先,利用端点检测算法以及动态时间弯折算法对受扰语音数据进行预处理。然后,提取数据中的语音内容并与标准语音进行测度计算得到5种测度,将5种测度融合后利用随机森林模型进行质量等级评估。最后,结合多模态融合技术,设计了基于残差结构的神经网络模型,融合受扰语音数据的图域、测度域特征并进行质量等级评估。实验结果表明,2种方法的评估准确率均达到了90%以上。其中,多模态评估方法与现有的研究方法相比,准确率提升了约3.269%,证明所提方法具有更优的性能。In view of the objective assessment problem of the effect of communication speech interference,methods based on multi-measurements and multimodal fusion were proposed.First,the interfered speech was preprocessed by the endpoint detection algorithm and time warping algorithm.Then,the content of speech was extracted and performed measurement calculated with the standard speech to obtain five kinds of measure.After the fusion of five measures,random forest model was used to assessed the quality level.Finally,a neural network model based on residual structure was designed combined multimodal fusion technique,which fused the graph domain and measure domain features of the interfered speech data and performed quality level assessment.Experimental results show that the accuracy of two methods have reached more than 90%.Among them,the multimodal assessment method improves the accuracy by about 3.269%compared with the existing research methods,which proves that it has a better performance.
关 键 词:语音质量评估 语音信号处理 多模态融合 深度神经网络
分 类 号:TN912.3[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7