检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]华侨大学工业生物技术研究所,福建泉州362021
出 处:《生物工程学报》2006年第2期293-298,共6页Chinese Journal of Biotechnology
基 金:国家自然科学基金资助项目(No.20276026);国务院侨办科研基金资助项目(No.05Q0018)。~~
摘 要:通过分析3216条嗜热蛋白和4007条常温蛋白的二肽组成,结果发现,在嗜热蛋白中存在更多EE,EK,KE,VE,EI,KI,EV,KK,VK和IE等二肽,更少AA,LL,LA,AL,QA,QL,AQ,LT,TL和EQ等二肽。在此基础上发展了一种识别嗜热和常温蛋白的统计学方法,通过对两组共853个蛋白序列进行识别,该方法识别平均正确率分别可达89.0%和89.6%。同时探讨了一些特定二肽对识别效果的影响。In this work, the dipeptide composition of 3216 thermophilic and 4007 mesophilic protein sequences was systematically analyzed. We found that the thermophilic proteins contained more dipeptides such as EE, EK, KE, VE, EI, KI, EV, KK,VK and IE, whereas less dipeptides such as AA,LL,LA,AL, QA,QL, AQ,LT,TL and EQ. Based on this information, a statistical method for discriminating thermophilic and mesophilic proteins was developed. Our approach correctly picked up the thermophilic proteins with the accuracy of 94.0% and 89%, respectively, for the testing sets of 382 and 73 thermophilic proteins. And for the testing 325 and 73 mesophilic proteins, the accuracy was 85.2 % and 89 %, respectively. The influence of specific dipeptides on discrimination was also discussed.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.150