检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:宗晓萍[1] 陶泽泽 ZONG Xiaoping;TAO Zeze(College of Electronic Information Engineering,Hebei University,Baoding 071002,China)
机构地区:[1]河北大学电子信息工程学院,河北保定071002
出 处:《河北大学学报(自然科学版)》2020年第2期193-199,共7页Journal of Hebei University(Natural Science Edition)
基 金:河北省高等教育教学改革研究与实践项目(2016GJJG016)。
摘 要:随着大数据在教育中的作用日益凸显,大量的数据被应用到教学研究、教学评估和行为预测.学生的成绩、行为记录、与老师的互动记录等教育数据,都已经开始发挥价值.为了解决课程的低通过率问题,将改进的K-近邻算法应用到学习预警中,首先利用网格搜索和交叉验证相结合的方法对模型参数进行优选,其次在构建决策树过程中,利用基尼增益确定特征的权重系数并且根据权重系数进行特征选择,在计算距离时引入权重系数,使每个特征收到权重系数的约束.实验表明,在一个公开的数据集和一个真实的数据集上,改进后的K-近邻算法显著优于传统的K-NN.With the increasing role of big data in education,a large amount of data is applied to teaching research,teaching evaluation and behavior prediction.Education data,such as students’grades,behavioral records,and interaction with teachers,have begun to show their value.In order to solve the problem of low pass rate in the course,improved K-nearest neighbor algorithm is applied to study the early warning.The grid search and cross validation method of combining the parameter optimization of the model was used first.Second in the process of constructing a decision tree,the Gini gain is used to determine the characteristics of the weight coefficient and according to the weight coefficient of feature selection,weight coefficient was introduced when calculating the distance,enables each feature received weight coefficient constraint.Experiments show that the improved K-nearest neighbor algorithm is significantly better than the traditional K-NN algorithm in both a public data set and a real data set.
关 键 词:教育数据挖掘 网格搜索 K-近邻 交叉验证 基尼增益
分 类 号:TP399[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.186