检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张茜[1] 孙一佳 白琳[2,3] 李陶深 ZHANG Qian;SUN Yijia;BAI Lin;LI Taoshen(The First Affiliated Hospital of Guangxi Medical University,Nanning,Guangxi,530021,China;School of Computer, Electronics and Information, Guangxi University,Nanning,Guangxi,530004,China;Guangxi Colleges and Universities Key Laboratory of Parallel and Distributed Computing Technology,Nanning,Guangxi,530004,China)
机构地区:[1]广西医科大学第一附属医院,广西南宁530021 [2]广西大学计算机与电子信息学院,广西南宁530004 [3]广西高校并行与分布式计算技术重点实验室,广西南宁530004
出 处:《广西科学》2019年第3期283-290,共8页Guangxi Sciences
基 金:广西自然科学基金项目(2018GXNSFAA138085)资助
摘 要:根据蛋白质氨基酸链探测其同源蛋白质,进而预测蛋白质的功能,是生物信息学研究领域的一个重要挑战,也是众多生物医学研究领域的基础研究内容,有着重要的科研价值和广泛的应用需求。其研究难点在于:(1)如何学习对同源蛋白质预测有效、有用的蛋白质特征信息;(2)如何更好地运用蛋白质特征信息,实现同源蛋白质的探测与识别。为了解决同源蛋白质探测与识别研究中的关键难点,本文提出一种基于混合深度学习架构的同源蛋白质探测与识别模型(HDLMPHP)。通过采用统一的“管道式”深度学习架构,将蛋白质特征学习和探测识别统一为一个整体,提高同源蛋白质探测与识别的效能。采用多组并行的深度卷积神经网络,学习蛋白质的各种属性信息,以期获得丰富的待检测蛋白质和靶蛋白质的高级相关性特征,并通过全连接方式使用多层RBM结构融合和精炼这些相关性特征为全局相关性特征。通过统一的深度网络连接方式,以探测和识别任务为导向,学习到对于同源蛋白质预测最有效、最全面的蛋白质特征信息。在标准数据集SCOPe上,对所提模型进行性能与效率评测,结果表明:本文提出的模型能有效地学习到符合任务导向的蛋白质特征数据,提升同源蛋白质探测与识别的准确度和召回率,优于现有的模型和算法。It is an important challenge in the field of bioinformatics research to detect its homologous proteins based on protein amino acid chains and to predict the function of proteins. It is also a basic research content in many biomedical research fields with important scientific research value and extensive application requirements. The research difficulties are how to learn effective and useful protein feature information for homologous protein prediction and how to better use protein feature information to achieve detection and recognition of homologous proteins. In order to solve the key difficulties in the research of homologous protein detection and recognition, this paper proposed a homologous protein detection and recognition model based on hybrid deep learning architecture (HDLM PHP). By using a unified "pipelined" deep learning architecture, protein feature learning and detection and recognition were unified into a single entity to improve the efficiency of homologous protein detection and recognition. The model used multiple sets of parallel deep convolutional neural networks to learn various attribute information of proteins and to obtain rich and advanced correlation features between the protein to be detected and the target protein. The multi layer RBM structure through full connection was used to fuse and refine these correlation features into global correlation features. Through a unified deep network connection, the most effective and comprehensive protein feature information for homologous protein prediction was achieved, which guided by detection and recognition tasks. On the standard dataset SCOPe, performance and efficiency evaluation of the proposed model was performed. The experimental results show that the proposed model can effectively learn the task oriented protein characteristic data and improve the accuracy and recall rate of homologous protein detection and recognition. The performance of this model is superior to existing models and algorithms.
关 键 词:混合深度学习 同源蛋白质 深度卷积神经网络 蛋白质特征提取 深度学习模型 机器学习算法
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.46