检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]北京工业大学生命科学与生物工程学院,北京100124
出 处:《生物信息学》2010年第1期1-6,共6页Chinese Journal of Bioinformatics
基 金:国家自然科学基金(30570427);北京市自然科学基金(4063035)资助项目
摘 要:蛋白质折叠识别是蛋白质结构研究的重要内容。双绕是α/β蛋白质中结构典型的常见折叠类型。选取22个家族中序列一致性小于25%的79个典型双绕蛋白质作为训练集,以RMSD为指标进行系统聚类,并对各类建立基于结构比对的概形隐马尔科夫模型(profile-HMM)。将Astral1.65中序列一致性小于95%的9 505个样本作为检验集,整体识别敏感性为93.9%,特异性为82.1%,MCC值为0.876。结果表明:对于成员较多,无法建立统一模型的折叠类型,分类建模可以实现较高准确率的识别。Fold recognition is an important issue in protein structure research.The Rossmann-fold protein that has typical structure is a common kind of α/β protein.The training set,selected from 22 families,is constituted of 79 Rossmann-fold proteins which have less than 25% sequence identity with each other.The hierarchical clustering method according to RMSD is applied and a profile-HMM based on structure alignment is built for each cluster.Testing on 9 505 proteins with less than 95% sequence identity from Astral1.65,the sensitivity,specificity and MCC are 93.9%,82.1% and 0.876 respectively.The result shows that building profile-HMMs after classification could reach precise fold recognition while a unified one cannot be built due to there are too many members in training set.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.4