检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张迪萌 李凤敏[1] ZHANG Dimeng;LI Fengmin(College of Science,Inner Mongolia Agricultural University,Hohhot O10018,China)
出 处:《内蒙古大学学报(自然科学版)》2024年第3期281-291,共11页Journal of Inner Mongolia University:Natural Science Edition
基 金:内蒙古自然科学基金项目(2019MS03015)。
摘 要:CRISPR-Cas系统是一种存在于细菌和古细菌中的获得性免疫系统,作为基因编辑工具在癌症治疗等方面被广泛研究,但CRISPR-Cas基因编辑技术存在基因脱靶效应。研究发现,Anti-CRISPR蛋白是一种可以调节CRISPR-Cas系统功能的蛋白质,在不破坏靶向基因编辑的情况下,可以减少脱靶效应等不良影响,从而提高基因编辑技术的效率和安全性。因此,研究Anti-CRISPR蛋白对于理解CRISPR-Cas系统的功能和细菌-病毒相互作用具有重要意义。本文构建了Anti-CRISPR蛋白数据集,提取氨基酸组分、氨基酸二肽组分、g-gap二肽组成、蛋白质二级结构、平均化学位移和蛋白质骨架6种特征参数,利用支持向量机对Anti-CRISPRs蛋白进行预测,在Jackknife检验下,单特征参数最高预测成功率为93.50%;对维度过高的氨基酸二肽组分和g-gap二肽组成分别进行降维处理,得到g-gap二肽组成在g=3、维数是121维时预测成功率最高,为95.10%,进一步研究发现有16种g-gap二肽组合对应11种氨基酸与Anti-CRISPR蛋白预测相关度较大;最后对特征参数进行融合,融合后最高预测成功率为96.07%。The CRISPR-Cas system is a natural immune system found in bacteria and archaea.It has been extensively studied as a gene editing tool in various fields,including cancer treatment.However,the CRISPR-Cas gene editing technology is associated with off-target effects.Based on research findings,Anti-CRISPR proteins are capable of modulating the functionality of the CRISPRCas system.These proteins can reduce off-target effects and other adverse impacts without compromising targeted gene editing,thereby improving the efficiency and safety of gene editing techniques.Therefore,studying Anti-CRISPR proteins is of significant importance for understanding the functionality of the CRISPR-Cas system and the bacterial-viral interactions.In this study,an Anti-CRISPR protein dataset was constructed,and six features,including amino acid composition,dipeptide composition,g-gap dipeptide composition,protein secondary structure,auto-covariance average chemical shift and protein blocks were extracted.Support vector machine(SVM)was employed for the prediction of Anti-CRISPR proteins.The highest accuracy of individual parameter is 93.50% with Jackknife test.Dimensionality reduction is performed on the high-dimensional dipeptide composition and g-gap dipeptide composition,and the highest accuracy of 95.10% is obtained when g is set to 3.Further research discovers that 16 g-gap dipeptide compositions correspond to 1l amino acids with high relevance to the prediction of Anti-CRISPR proteins.Finally,the highest accuracy of combined features is 96.07% with Jackknife test.
关 键 词:Anti-CRISPR蛋白 特征信息 降维 预测
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49