检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:向妍[1] 陈渊[1] 谭泗桥[2] 袁哲明[1,3]
机构地区:[1]湖南农业大学植物病虫害生物学与防控湖南省重点实验室,长沙410128 [2]湖南农业大学信息科学技术学院,长沙410128 [3]湖南农业大学,湖南省作物种质创新与资源利用重点实验室,长沙410128
出 处:《生物化学与生物物理进展》2016年第7期691-698,共8页Progress In Biochemistry and Biophysics
基 金:高等学校博士学科点专项科研基金(20124320110002);湖南省自然科学基金(14JJ2082);长沙市科技计划项目(K1406018-21)资助
摘 要:糖基化是蛋白质翻译后的主要修饰,O-糖基化的固定模式未知,高精度识别O-糖基化位点是机器学习面临的挑战性问题.以迄今最大的人O-糖基化位点Steentoft数据集为基础,本文首次提出了基于位置的卡方差表特征χ^2pos,融合伪氨基酸序列进化信息Pse PSSM以及无方向的k间隔氨基酸对组分Undirected-CKSAAP表征序列,构建5个正负样本均衡的支持向量机分类器,经加权投票,独立测试准确率、Matthew相关系数及ROC曲线下面积,分别达到了89.62%、0.79、0.96,明显优于文献报道结果.χ^2pos、Pse PSSM与Undirected-CKSAAP三种特征的融合在蛋白质糖基化、磷酸化等位点预测中有广泛应用前景.Glycosylation is a major modification process in post-translational modification of protein.Accurate prediction of O-linked glycosylation sites is a big challenging faced by machine-learning,for the fixed-model of O-linked glycosylation is not yet known.In this paper,on the basis of the largest-ever Steentoft database up to now,a new feature——chi-square score difference table method based on position(χ^2-pos) was first proposed,which combined with pseudo position-specific scoring matrix(Pse PSSM) and undirected composition of k-spaced amino acid pairs(Undirected-CKSAAP) were used to present protein sequences.Then 5 support vector machines models were constructed with the same proportion of positive and negative samples.At last,by weighted voting,our results showed that the prediction accuracy,Matthew's correlation coefficient and area under ROC curve reached89.62%,0.79 and 0.96 respectively.They were superior to the literature report.It also demonstrated that the combination of three different features χ^2-pos,Pse PSSM and Undirected-CKSAAP has extensive application prospect in protein sites prediction such as glycosylation and phosphorylation.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15