检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:孙杰[1,2] 吾守尔·斯拉木[1] 热依曼·吐尔逊[1] 张晶晶[1] SUN Jie;WUSHOUER Silamu;REYIMAN Turson;ZHANG Jingjing(College of Information Science and Engineering Xinjiang University,Urumqi 830046;Department of Physics,Changji University,Changji 831100)
机构地区:[1]新疆大学信息与工程学院,乌鲁木齐830046 [2]昌吉学院物理系,昌吉831100
出 处:《声学学报》2019年第6期1083-1092,共10页Acta Acustica
基 金:新疆维吾尔自治区重点实验室项目(2015KL013);973计划子课题(2014CB340506,213-61590);国家自然科学基金项目(61433012,U1435215,U1603262)资助
摘 要:根据语音识别和声纹识别等语音应用研究的实际需要,首次对和田方言的声学特性和识别进行研究。首先选取和田方言语音进行人工多层级标注,对元音的共振峰、时长和音强进行统计分析,描绘出和田方言主体格局及男性和女性的发音特点。然后运用方差分析和非参数分析法对维吾尔语3种方言的共振峰样本进行检验,结果表明3种方言的男性元音、女性元音及整体元音的共振峰分布模式存在显著差异。最后,分别构建基于GMM-UBM(Gaussian Mixture Model-Universal Background Model)、DNN-UBM(Deep Neural Networks-Universal Background Model)和LSTM-UBM(Long Short Term MemoryUniversal Background Model)维吾尔语方言识别模型,对基于梅尔频率倒谱系数及其与共振峰频率组合做输入特征提取的方言i-vector区分性进行对比实验。实验结果表明融入共振峰系数的组合特征可以增加方言的辨识度,且LSTM-UBM模型较GMM-UBM和DNN-UBM能提取到更具区分性的方言i-vector。According to meet the need of speech recognition and speaker recognition in Hotan area,we have completed the acoustic analysis and model of Hotan dialect for the first time.At first,by choosing and annotation the sentences of Hotan dialect,we conduct the acoustic statistical analysis of vowel formant frequency,duration and intensity.Based on that,the main pattern of Hotan dialect vowels,vowels pronunciation spoken by male and female are described.Then we have built the based GMM-UBM,DNN-UBM and LSTM-UBM dialect accent recognition model respectively,which based on to compare the formait patter,of three of Uygur dialects,and find some significalt differences betwee them.At last we compared the dialect discrimination of i-vectors between using MFCC coefficient with and without formant frequency as input feature respectively.It shows that the recognition rate using combination features is better than a single feature.
关 键 词:和田方言 维吾尔语 共振峰频率 声学分析 区分性 发音特点 声纹识别 语音识别
分 类 号:H215[语言文字—少数民族语言] TN912.3[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15