基于特征选择的方言辨别模型

Dialect identification model based on feature selection

作　　者：艾虎李菲 AI Hu;LI Fei(Department of Criminal Technology,Guizhou Police College,Guiyang 550005,China;College of Foreign Languages,Guizhou Normal University,Guiyang 550025,China)

机构地区：[1]贵州警察学院刑事技术系,贵阳550005 [2]贵州师范大学外国语学院,贵阳550025

出　　处：《信息技术》2024年第10期102-110,119,共10页Information Technology

基　　金：贵州省教育厅创新群体项目(黔教合KY字[2021]023)。

摘　　要：为了从语音样本中选择数量最少的相关特征变量,并让基于随机森林(RF)的贵州汉语方言辨别模型达到所需的精度。该研究采用基于随机森林的差异排序向后消除法(SDBE),利用Python 3.6,对贵州3个市县群的汉语方言语音样本进行特征选择,并与其他先进的特征选择方法进行比较,最后对随机森林分类模型进行改进。结果显示,该方法从39个特征变量中选取了8个最相关的梅尔频率倒谱系数(MFCC),显著优于与之比较的特征选择方法。经过改进的随机森林模型分类精确度为96.64%。该研究采用的特征选择算法和改进的随机森林模型,让方言辨别模型的性能得到显著提升。In order to select the least number of relevant feature variables from the speech samples and make the Guizhou dialect identification model based on Random Forest(RF)achieve the required accuracy,the Python 3.6 is used and the Sort Difference Backward Elimination(SDBE)algorithm based on Random Forests is applied to select important relevant feature variables from the Chinese dialect speech samples of three city groups in Guizhou Province.Nextly,SDBE algorithm is compared with other advanced feature selection algorithms.Finally,the Random Forest is improved.The results show that SDBE algorithm selected eight of the most relevant MFCC from 39 feature variables which are significantly outperform the compared feature selection algorithms.The classification accuracy of the improved Random Forest model reaches 96.64%.SDBE algorithm and the improved Random Forest model have significantly improved the performance of the dialect recognition model.

关键词：汉语方言辨识梅尔频率倒谱系数特征选择随机森林向后消除法

分类号：TP391.42[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于特征选择的方言辨别模型

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于特征选择的方言辨别模型

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索