改进的K-近邻算法及其在学习预警中的应用  被引量:4

Improved K-nearest neighbor algorithm and its application in learning and warning

在线阅读下载全文

作  者:宗晓萍[1] 陶泽泽 ZONG Xiaoping;TAO Zeze(College of Electronic Information Engineering,Hebei University,Baoding 071002,China)

机构地区:[1]河北大学电子信息工程学院,河北保定071002

出  处:《河北大学学报(自然科学版)》2020年第2期193-199,共7页Journal of Hebei University(Natural Science Edition)

基  金:河北省高等教育教学改革研究与实践项目(2016GJJG016)。

摘  要:随着大数据在教育中的作用日益凸显,大量的数据被应用到教学研究、教学评估和行为预测.学生的成绩、行为记录、与老师的互动记录等教育数据,都已经开始发挥价值.为了解决课程的低通过率问题,将改进的K-近邻算法应用到学习预警中,首先利用网格搜索和交叉验证相结合的方法对模型参数进行优选,其次在构建决策树过程中,利用基尼增益确定特征的权重系数并且根据权重系数进行特征选择,在计算距离时引入权重系数,使每个特征收到权重系数的约束.实验表明,在一个公开的数据集和一个真实的数据集上,改进后的K-近邻算法显著优于传统的K-NN.With the increasing role of big data in education,a large amount of data is applied to teaching research,teaching evaluation and behavior prediction.Education data,such as students’grades,behavioral records,and interaction with teachers,have begun to show their value.In order to solve the problem of low pass rate in the course,improved K-nearest neighbor algorithm is applied to study the early warning.The grid search and cross validation method of combining the parameter optimization of the model was used first.Second in the process of constructing a decision tree,the Gini gain is used to determine the characteristics of the weight coefficient and according to the weight coefficient of feature selection,weight coefficient was introduced when calculating the distance,enables each feature received weight coefficient constraint.Experiments show that the improved K-nearest neighbor algorithm is significantly better than the traditional K-NN algorithm in both a public data set and a real data set.

关 键 词:教育数据挖掘 网格搜索 K-近邻 交叉验证 基尼增益 

分 类 号:TP399[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象