检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:杨品 任振华[1] 袁增强 YANG Pin;REN Zhenhua;YUAN Zengqiang(School of Basic Medical Sciences,Anhui Medical University,Hefei,230032;The Brain Science Center,Beijing Institute of Basic Medical Sciences,Beijing,100850)
机构地区:[1]安徽医科大学基础医学院,合肥230032 [2]北京基础医学研究所脑科学中心,北京100850
出 处:《基因组学与应用生物学》2024年第7期1196-1213,共18页Genomics and Applied Biology
基 金:国家自然科学基金项目(81930029)资助。
摘 要:多模态组学数据的联合运用在揭示细胞异质性和解析细胞命运调控机制方面具有重要意义,目前已有多种方法被开发,用于处理不同组学模态数据的整合。本研究通过对应用于不同整合任务的多种数据整合方法进行性能评测,为相关领域的研究提供有益参考。使用6个联合测序数据集对16种单细胞多模态配对数据整合方法在2类整合任务上的性能进行测试,再通过4个模拟数据集和1个真实数据集对6种空间转录组反卷积方法的性能进行评估。在RNA和ATAC配对数据整合任务中,MOFA+、 SCOIT、 Cobolt分别在PBMC、 BMMC、 SNARE数据集上取得最优表现,SCOIT在3个数据集的汇总得分中均排名前3, MMDVAE、 DAE在基于AE的融合算法中表现突出。在RNA和蛋白质配对数据整合任务中,Cobolt、 MOFA+、 Seurat分别在P5_CITE、 BM_CITE、 COVID中取得最优表现,totalVI在3个数据集的汇总得分中排名靠前,基于AE的融合算法中以efMMDVAE、 lfMMDVAE的表现最好。在空间转录组反卷积方法评测中,Cell2location和SPACEL在模拟数据和真实数据中的性能表现均优于其他方法的,其中Cell2location在真实数据集中表现最佳,正确地推断了两类心肌细胞在心室的比例。此外,本研究发现在配对数据整合任务中,不同方法对数据的适应性不同。SCOIT和totalVI分别是RNA与ATAC、 RNA与蛋白质数据整合中表现稳定优异的方法。Seurat、 MOFA+易受数据影响。The joint application of multimodal omics data plays a significant role in revealing cellular heterogeneity and elucidating the mechanisms regulating cell fate.At present,a variety of methods have been developed for the integration of multi-omics modalities.This study conducted performance evaluations on several data integration methods applied to different integration tasks,providing a useful reference for research in related fields.Initially,the performance of 16 single-cell multi-modal paired data integration methods was tested on 6 joint sequencing datasets for 2 integration tasks.Subsequently,the performance of 6 spatial transcriptomic deconvolution methods was assessed using four simulated datasets and one real dataset.For RNA and ATAC paired integration task,MOFA+,SCOIT,and Cobolt each achieved optimal performance on PBMC,BMMC,and SNARE datasets respectively,with SCOIT ranking in the top three in the aggregate scores across all three datasets.MMDVAE and DAE are prominent among the AE-based fusion algorithms.In RNA and protein paired integration task,Cobolt,MOFA+,and Seurat respectively attained optimal performance on P5_CITE,BM_CITE,and COVID datasets,with totalVI ranking prominently in aggregate scores for all three datasets.Among the fusion algorithms based on AE,efMMDVAE and lfMMDVAE perform best.During the evaluation of spatial transcriptomic deconvolution methods,Cell2location and SPACEL outperformed other methods in both simulated and real datasets,with Cell2location demonstrating the best performance in the real dataset by accurately inferring the proportions of two types of cardiomyocytes in the ventricles.Moreover different methods exhibit varying adaptabilities to data in paired data integration tasks. SCOIT and totalVI respectively emerged asstable and excellent performers in RNA with ATAC and RNA with protein data integrations. Seurat and MOFA+ are sensitive to theinfluence of data.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.220.1.197