基于图协同过滤的单细胞RNA测序数据填补  

Imputation of scRNA-seq Data Based on Graph Collaborative Filtering

在线阅读下载全文

作  者:李雪枫 

机构地区:[1]中国地质大学(武汉)数学与物理学院,湖北 武汉

出  处:《应用数学进展》2024年第4期1800-1809,共10页Advances in Applied Mathematics

摘  要:单细胞RNA测序(Single-cell RNA Sequencing, scRNA-seq)技术能以单细胞的分辨率分析转录组数据,在生物学研究中展现出广泛的应用前景。然而技术问题会导致scRNA-seq数据存在部分基因表达缺失的情况,称之为零膨胀事件。这种情况严重阻碍了下游分析,故需要对scRNA-seq数据进行填补。本文提出了一种基于图协同过滤的单细胞RNA测序数据填补算法,为scRNA-seq分析提供了一个深度学习框架。它通过结构邻居对比的图协同过滤方法提取细胞特征表示和基因特征表示,并将两者的内积应用于零膨胀负二项分布自编码器来填补scRNA-seq数据。仿真实验结果验证了该算法在仿真数据集上的填补能力,且通过下游聚类分析实验表明该算法在公共真实数据集上细胞聚类的性能。Single-cell RNA sequencing (scRNA-seq) technology can analyze transcriptome data at the single-cell level and is widely used in biology. However, technical issues can lead to missing gene expression in scRNA-seq data, which is called zero-inflation event. This situation seriously hinders downstream analysis, so it is necessary to impute the scRNA-seq data. This article proposes an imputation algorithm of scRNA-seq data based on graph collaborative filtering, providing a deep learning framework for scRNA-seq analysis. It extracts cell feature representations and gene feature representations through the graph collaborative filtering method of comparing structural neighbors, and applies the inner product of the two to the zero-inflated negative binomial distribution autoencoder to impute scRNA-seq data. The simulation experiment results have verified the imputation ability of the algorithm on the simulation dataset, and downstream clustering analysis experiments have shown the performance of the algorithm on cell clustering on public real datasets.

关 键 词:单细胞RNA测序 填补 图协同过滤 零膨胀负二项分布 

分 类 号:TP3[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象