机构地区:[1]State Key Laboratory of Stem Cell and Reproductive Biology,Institute of Zoology,Chinese Academy of Sciences,Beijing,China [2]Beijing Key Laboratory of Mobile Computing and Pervasive Device,Institute of Computing Technology,Chinese Academy of Sciences,Beijing,China [3]University of Chinese Academy of Sciences,Beijing,China [4]State Key Laboratory of Multimodal Artificial Intelligence Systems,Institute of Automation,Chinese Academy of Sciences,Beijing,China [5]School of Artificial Intelligence,University of Chinese Academy of Sciences,Beijing,China [6]Institute for Stem Cell and Regenerative Medicine,Chinese Academy of Sciences,Beijing,China [7]Beijing Institute for Stem Cell and Regenerative Medicine,Beijing,China [8]Research Center for Ubiquitous Computing Systems,Institute of Computing Technology,Chinese Academy of Sciences,Beijing,China [9]Computer Network Information Center,Chinese Academy of Sciences,Beijing,China [10]Institute of Automation,Chinese Academy of Sciences,Beijing,China [11]CEMS,NCMIS,HCMS,MDIS,RCSDS,Academy of Mathematics and Systems Science,Chinese Academy of Sciences,Beijing,China [12]Institute of Zoology,Chinese Academy of Sciences,Beijing,China [13]Institute of Computing Technology,Chinese Academy of Sciences,Beijing,China [14]Academy of Mathematics and Systems Science,Chinese Academy of Sciences,Beijing,China
出 处:《Cell Research》2024年第12期830-845,共16页细胞研究(英文版)
基 金:This work was also supported by CAS Project for Young Scientists in Basic Research(YSBR-076 and YSBR-034);the National Natural Science Foundation of China(31971289,32341013,91954201,62202455,and 32341019);the Informatization Plan of Chinese Academy of Sciences(CAS-WX2021SF-0101).
摘 要:Deciphering universal gene regulatory mechanisms in diverse organisms holds great potential for advancing our knowledge of fundamental life processes and facilitating clinical applications.However,the traditional research paradigm primarily focuses on individual model organisms and does not integrate various cell types across species.Recent breakthroughs in single-cell sequencing and deep learning techniques present an unprecedented opportunity to address this challenge.In this study,we built an extensive dataset of over 120 million human and mouse single-cell transcriptomes.After data preprocessing,we obtained 101,768,420 single-cell transcriptomes and developed a knowledge-informed cross-species foundation model,named GeneCompass.During pre-training,GeneCompass effectively integrated four types of prior biological knowledge to enhance our understanding of gene regulatory mechanisms in a self-supervised manner.By fine-tuning for multiple downstream tasks,GeneCompass outperformed state-of-the-art models in diverse applications for a single species and unlocked new realms of cross-species biological investigations.We also employed GeneCompass to search for key factors associated with cell fate transition and showed that the predicted candidate genes could successfully induce the differentiation of human embryonic stem cells into the gonadal fate.Overall,GeneCompass demonstrates the advantages of using artificial intelligence technology to decipher universal gene regulatory mechanisms and shows tremendous potential for accelerating the discovery of critical cell fate regulators and candidate drug targets.
关 键 词:KNOWLEDGE MECHANISMS FOUNDATION
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...