基于神经网络集成方法预测膜蛋白类型  

Prediction of Membrane Protein Types Based on Neural Network Ensemble Strategy

在线阅读下载全文

作  者:方海洋[1] 赵静[1] 汪益川[2] 宗福兴[3] 

机构地区:[1]后勤工程学院基础部,重庆401311 [2]后勤工程学院科研部,重庆401311 [3]后勤工程学院后勤信息与军事物流工程系,重庆401311

出  处:《后勤工程学院学报》2013年第5期64-70,共7页Journal of Logistical Engineering University

基  金:国家自然科学基金项目(10971227;81260672;81230090)

摘  要:由Chou等人提出的预测膜蛋白分类的机器学习算法在近年来不断改进,使得预测膜蛋白类型的准确率越来越高。但是由于膜蛋白类分布不均衡而导致少数类的预测准确率非常低,使用神经网络集成方法能解决此问题。该方法中Bagging算法通过对多数类欠采样和少数类过采样来解决膜蛋白训练数据集不均衡问题。此外,用神经网络集成方法对已训练数据集和独立数据集进行分类测试,得出神经网络集成方法预测效果优于单个最好神经网络。该方法为解决蛋白质分类预测问题提供了一种新的策略,特别是训练数据集不均衡时,该方法的优势更加明显。As a continuous effort to develop machine learning algorithms to predict membrane protein types that was initiated by Chou and Elrod, this study focuses on dealing with the problem of imbalanced training set of membrane protein types with the neural network ensemble. Bagging algorithm of the neural network ensemble has the advantage of dealing with imbalanced training set of membrane protein types by over-sampling minority classes and under-sampling majority classes. Furthermore, the perfor- mance of the neural network ensemble is found to be superior to the single best model from the results obtained through resubstitu-tion and independent dataset tests. The current approach represents a new strategy to deal with the problems of protein attribute pre-diction, especially when the training set is imbalanced, and hence is quite promising in the area of proteomies.

关 键 词:膜蛋白类型 神经网络集成 BAGGING 投票策略 

分 类 号:O29[理学—应用数学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象