检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:徐周波[1] 李萍 刘华东[1] 李珍 XU Zhou-bo;LI Ping;LIU Hua-dong;LI Zhen(Guangxi Key Laboratory of Trusted Software,Guilin University of Electronic Technology,Guilin 541004,China)
机构地区:[1]桂林电子科技大学广西可信软件重点实验室,广西桂林541004
出 处:《计算机工程与科学》2021年第6期1052-1059,共8页Computer Engineering & Science
基 金:国家自然科学基金(61762027,U1501252);广西自然科学基金(2017GXNSFAA198172)。
摘 要:蛋白质复合物是细胞结构和生化机制的研究基础,如何准确识别蛋白质复合物成为近年来的研究热点。针对传统算法根据结构信息对蛋白质复合物进行搜索存在敏感度和F-measure低的问题,以及现有监督学习算法根据人为构造特征进行蛋白质复合物识别存在特征构造不能较好地反映图的真实信息等不足,提出了graph2vec-SVM识别算法。将蛋白质复合物看作稠密子图并考虑子图模块度大小,利用graph2vec将图信息转换为向量,并进一步采用SVM分类器对蛋白质复合物进行识别,提高了蛋白质复合物识别的敏感度和F-measure。该算法分别与目前流行的4种非监督学习算法(ClusterOne、CMC、HC-PIN和COACH)和3种监督学习算法(SCI-BN、SCI-SVM和RM)进行比较,在精准度、敏感度和F-measure 3项指标上都显示出了良好的性能。Protein complex is the basis of cell structure and biochemical mechanism.How to recognize protein complex accurately has become a popular research direction in recent years.Traditional algorithms has low sensitivity and F-measure in searching protein complexes based on structural information,and the artificial construction features can not reflect the real information of the graph when the existing supervised learning algorithms use machine learning algorithms to identify protein complexes.In order to solve the aforementioned problems,a graph2vec SVM recognition algorithm is proposed.In this algorithm,the protein complex is regarded as a dense subgraph,and the modularity of the subgraph is considered.graph2vec technology is used to transform the graph information into vectors,and SVM classifier is used to recognize the protein complex,which improves the sensitivity of protein complex re-cognition and F-measure.Compared with four popular unsupervised learning algorithms(ClusterONE,CMC,HC-PIN and Coach)and three supervised learning algorithms(SCI-BN,SCI-SVM and RM),the algorithm shows good performance in terms of accuracy,sensitivity and F-measure.
关 键 词:蛋白质复合物 gragh2vec SVM 蛋白质相互作用网络
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.117.146.157