检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:陈少华 胡秀珍[1] 胡慧敏 姚雨倩 CHEN Shaohua;HU Xiuzhen;HU Huimin;YAO Yuqian(School of Sciences,Inner Mongolia University of Technology,Hohhot O10051,China)
出 处:《内蒙古大学学报(自然科学版)》2024年第2期183-192,共10页Journal of Inner Mongolia University:Natural Science Edition
基 金:国家自然科学基金项目(61961032)。
摘 要:SO_(4)^(2-)和PO_(4)^(3-)配体与蛋白质相结合在生命活动中起着重要的作用,因此,准确预测蛋白质-酸根离子配体结合残基具有重要意义。前人对酸根离子配体结合位点的研究多数是在片段水平上进行的,而极少考虑单残基水平,这可能导致信息的缺失。因此,在片段和单残基水平两个方面提取特征,可以避免信息丢失。在片段水平上使用前人对氨基酸、二级结构、相对溶剂可及性和亲疏水提取的组分信息和位点保守信息作为基础特征,在此基础上引入了单残基水平上的氨基酸、氨基酸的酸碱极性、能量及亲疏水的倾向性因子;结合残基左右残基对信息和9个正交因子为新的特征,使用欠采样和随机森林相融合的算法(U-RF)进行五交叉检验和独立检验,得到了好于前人的预测结果。The SO_(4)^(2-) and PO_(4)^(3-) play crucial roles in binding with proteins,making accurate prediction of protein-anion binding residues essential.Previous research on anion binding sites has primarily focused on the fragment level,while neglected the single residue level.This potentially leads to information loss.Therefore,features were extracted at both fragment and single residue levels to avoid information loss.At the fragment level,this study utilized amino acid,secondary structure,relative solvent accessibility,and hydrophilic-hydrophobic extracted from previous studies as foundational features,and introduced single residue-level amino acid,amino acid acid-base polarity,energy,and hydrophilic-hydrophobic propensity factors,Combining neighboring residue pair information and 9 orthogonal factors as new features.An algorithm(U-RF) combining undersampling and random forest was performed,and a promising prediction result was verified by five-fold cross-validation and independent testing.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49