方差正则化的分类模型选择准则  被引量:1

Variance-Regularized Classification Model Selection Criterion

在线阅读下载全文

作  者:房立超 王钰 杨杏丽[1] 李济洪 FANG Lichao;WANG Yu;YANG Xingli;LI Jihong(School of Mathematical Sciences,Shanxi University,Taiyuan 030006,China;School of Modern Educational Technology,Shanxi University,Taiyuan 030006,China;School of Software,Shanxi University,Taiyuan 030006,China)

机构地区:[1]山西大学数学科学学院,太原030006 [2]山西大学现代教育技术学院,太原030006 [3]山西大学软件学院,太原030006

出  处:《计算机科学与探索》2019年第3期457-467,共11页Journal of Frontiers of Computer Science and Technology

基  金:国家自然科学基金61503228;山西省自然科学基金201601D011046;山西省回国留学人员科研资助项目2015-014~~

摘  要:在传统的机器学习中,模型选择常常是直接基于某个性能度量指标的估计本身进行,没有考虑估计的方差,但是这样的忽略极有可能导致错误模型的选择。于是考虑在分类模型选择研究中添加方差的信息的方法,以提高所选模型的泛化能力,即将泛化误差性能度量指标的组块3×2交叉验证估计的方差估计作为正则化项添加到传统模型选择准则中,提出了一种新的方差正则化的分类模型选择准则。模拟和真实数据实验验证了在分类模型选择问题中,提出的模型选择准则相比传统方法选到正确分类模型的概率更大,验证了方差在模型选择中的重要性以及提出的模型选择准则的有效性。进一步,理论上证明了在二分类问题的模型选择中,该模型选择准则具有选择的一致性。In traditional machine learning,model selection is always directly performed based on the estimation of one performance measure index,without considering the variance of the estimation.However,this neglection may probably lead to the selection of a wrong model.Therefore,a method of adding the information of variance into the study of classification model selection is considered in order to improve the generalization ability of the selected model,that is,the variance estimation of the block 3×2 cross-validation estimation of the generalization error is added as a regularization term into the traditional model selection criterion,and a new variance-regularized classification model selection criterion is proposed.The simulated and real data experiments show that the proposed model selection criterion has a higher probability to select the correct classification model in the classification model selection problem compared to the traditional methods.The importance of variance in model selection and the effectiveness of the proposed model selection criteria are also validated.Furthermore,the consistency in selection of the proposed criterions is theoretically proven in the model selection task of two-class classification problem.

关 键 词:模型选择 泛化误差 组块3×2交叉验证 方差正则化 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象