基于次生特征提取方法预测蛋白质同源寡聚体  

Prediction of Protein Homo-Oligomer Types With a Novel Approach of Secondary Feature Extraction

在线阅读下载全文

作  者:李启鹏[1,2] 张绍武[1] 潘泉[1] 陈伟[1] 

机构地区:[1]西北工业大学自动化学院,西安710072 [2]西北工业大学机电学院,西安710072

出  处:《北京生物医学工程》2010年第1期16-22,共7页Beijing Biomedical Engineering

基  金:国家自然科学基金(60775012;60634030);西北工业大学科技创新项目(KC02)资助

摘  要:寡聚蛋白质相对于单体蛋白质具有许多优势,广泛地参与多种生命活动。本文提出次生特征提取方法,使用支持向量机作为分类器,采用"一对一"的多类分类策略,基于蛋白质一级序列提取特征方法,对四类同源寡聚体进行分类研究。结果表明,在Jackknife检验下,基于次生特征和氨基酸组成成分特征构成的特征集,加权情况下,其总分类精度最高达到了78.41%,比氨基酸组成成分特征提高13.09%,比参考文献最好特征集BG提高了6.86%,比最好原生特征集CM1提高了5.53%。此结果说明次生特征提取方法对于蛋白质同源寡聚体分类是一种非常有效的特征提取方法。Protein homo-oligomers play an important role in various life processes. The secondary feature extraction method was proposed and used for predicting protein homo-oligomers. Processing primary features by statistical methods to increase the distance among primary features, secondary feature can be obtained. The support vector machine ( SVM ) was used as base classifier. The 78.41% total accuracy was arrived in jackknife test in the weighted factor conditions, which was 13.09% ,6.86% and 5.53% higher than those of conventional amino acid composition methods, that of the reference feature set BG and that of the best primary feature set CM1 in same condition respectively. The experimental results showed that the secondary feature extraction method is effective to increase the distance among primary features and improved the classification prediction performance.

关 键 词:同源寡聚体 支持向量机 特征提取 原生特征 次生特征 

分 类 号:Q617[生物学—生物物理学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象