检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Mateusz Chilinski Anup Kumar Halder Dariusz Plewczynski
机构地区:[1]Faculty of Mathematics and Information Sciences,Warsaw University of Technology,00-662 Warsaw,Poland [2]Laboratory of Functional and Structural Genomics Centre of New Technologies University of Warsaw,02-097 Warsaw,Poland
出 处:《Quantitative Biology》2023年第2期155-162,共8页定量生物学(英文版)
基 金:supported by National Science Centre,Poland(Nos.2019/35/O/ST6/02484 and 2020/37/B/NZ2/03757);Foundation for Polish Science,co-financed by the European Union under the European Regional Development Fund(TEAM to DP);The work has been co-supported by European Commission Horizon 2020 Marie Skodowska-Curie ITN Enhpathy grant“Molecular Basis of Human enhanceropathies”and National Institute of Health USA 4DNucleome grant 1U54DK107967-01“Nucleome Positioning System for Spatiotemporal Genome Organization and Regulation”:Research was co-funded by Warsaw University of Technology within the Excellence Initiative:Research University(IDUB)programme.Computations were performed thanks to the Laboratory of Bioinformatics and Computational Genomics,Faculty of Mathematics and Information Science,Warsaw University of Technology using the Artificial Intelligence HPC platform financed by Polish Ministry of Science and Higher Education(No.7054/IA/SP/2020 of 2020-08-28).
摘 要:Background:With the development of rapid and cheap sequencing techniques,the cost of whole-genome sequencing(WGS)has dropped significantly.However,the complexity of the human genome is not limited to the pure sequenceand additional experiments are required to learn the human genome's influence on complex traits.One of the most exciting aspects for scientists nowadays is the spatial organisation of the genome,which can be discovered using spatial experiments(e.g.,Hi-C,ChIA-PET).The information about the spatial contacts helps in the analysis and brings new insights into our understanding of the disease developments.Methods:We have used an ensemble of deep learning with classical machine learning algorithms.The deep learning network we used was DNABERT,which utilises the BERT language model(based on transformers)for the genomic function.The classical machine learning models included support vector machines(SVMs),random forests(RFs),and K-nearest neighbor(KNN).The whole approach was wrapped together as deep hybrid learning(DHL).Results:We found that the DNABERT can be used to predict the ChIA-PET experiments with high precision.Additionally,the DHL approach has increased the metrics on CTCF and RNAPII sets.Conclusions:DHL approach should be taken into consideration for the models utilising the power of deep learning.While straightforward in the concept,it can improve the results significantly.
关 键 词:deep learning 3D genomics transformers spatial organisation of nucleus ChIA-PET DNA-Seq
分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.12.34.36