检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Bingxin Zhou Yang Tan Yutong Hu Lirong Zheng Bozitao Zhong Liang Hong
机构地区:[1]Institute of Natural Sciences,Shanghai Jiao Tong University,Shanghai,China [2]Shanghai National Center for Applied Mathematics(SJTU center),Shanghai Jiao Tong University,Shanghai,China [3]School of Information Science and Engineering,East China University of Science and Technology,Shanghai,China [4]Shanghai Artificial Intelligence Laboratory,Shanghai,China [5]School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,Shanghai,China [6]Department of Cell and Developmental Biology,University of Michigan Medical School,Ann Arbor,Michigan,USA [7]School of Physics and Astronomy,Shanghai Jiao Tong University,Shanghai,China [8]Zhangjiang Institute for Advanced Study,Shanghai Jiao Tong University,Shanghai,China
出 处:《mLife》2024年第4期477-491,共15页微生物(英文)
基 金:supported by the National Natural Science Foundation of China(11974239 and 62302291);Innovation Program of Shanghai Municipal Education Commission(2019‐01‐07‐00‐02‐E00076);Shanghai Jiao Tong University Scientific and Technological Innovation Funds(21×010200843)。
摘 要:Advances in deep learning have significantly aided protein engineering in addressing challenges in industrial production,healthcare,and environmental sustainability.This review frames frequently researched problems in protein understanding and engineering from the perspective of deep learning.It provides a thorough discussion of representation methods for protein sequences and structures,along with general encoding pipelines that support both pre‐training and supervised learning tasks.We summarize state‐of‐the‐art protein language models,geometric deep learning techniques,and the combination of distinct approaches to learning from multi‐modal biological data.Additionally,we outline common downstream tasks and relevant benchmark datasets for training and evaluating deep learning models,focusing on satisfying the particular needs of protein engineering applications,such as identifying mutation sites and predicting properties for candidates'virtual screening.This review offers biologists the latest tools for assisting their engineering projects while providing a clear and comprehensive guide for computer scientists to develop more powerful solutions by standardizing problem formulation and consolidating data resources.Future research can foresee a deeper integration of the communities of biology and computer science,unleashing the full potential of deep learning in protein engineering and driving new scientific breakthroughs.
关 键 词:artificial intelligence geometric deep learning protein engineering protein language model synthetic biology
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.26