检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:林炳清 庄泽帆 LIN Bingqing;ZHUANG Zefan(School of Mathematics and Statistics,Shenzhen University,Shenzhen,518060,China)
出 处:《应用概率统计》2023年第6期813-831,共19页Chinese Journal of Applied Probability and Statistics
摘 要:随着新一代测序技术的广泛使用,单细胞RNA数据逐渐成为研究的主流对象.然而,直接从生物体上获取单细胞RNA数据往往需要付出不小的成本.如何简单快捷地获取这些数据便是一个重要的问题.为了满足对比实验的需要,单细胞RNA数据的模拟方法通常除了模拟数据的统计量和原始数据接近以外,还需要在模拟数据中能够保留原数据的基因和细胞样本.在这里我们介绍了一种基于数据的模拟方法,在保留原数据的基因和细胞样本的基础上,不但可以低成本地模拟单细胞RNA数据,同时保证模拟结果和原数据在大部分特征上相似.通过大量数值实验证明,本文介绍的方法在基因表达的离散程度、0表达比例、表达异常值等方面都优于其他模拟方法,而且和实际数据更加接近.With the wide use of new generation sequencing technology,single-cell RNA data has gradually become the mainstream object of research.However,it is costly to obtain single-cell RNA data directly from organisms.Therefore,how to obtain these data simply and quickly is an important problem.In order to meet the needs of comparative experiments,the simulation method of single-cell RNA data usually needs not only the statistics of the simulation data are close to the original data,but also the gene and cell samples that can retain the original data in the simulation data.Here,we introduce a data-based simulation method.On the basis of retaining the gene and cell samples of the original data,we can simulate the single-cell RNA data at low cost and ensure that the simulation results are similar to the original data in most characteristics.Through a large number of numerical experiments,it is proved that the proposed method is superior to other simulation methods in terms of distribution of gene expression.
关 键 词:生物信息学 单细胞RNA数据 非参数方法 模拟数据
分 类 号:O212.7[理学—概率论与数理统计]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15