ZINBMM: a general mixture model for simultaneous clustering and gene selection using single-cell transcriptomic data

Abstract Clustering is a critical component of single-cell RNA sequencing (scRNA-seq) data analysis and can help reveal cell types and infer cell lineages. Despite considerable successes, there are few methods tailored to investigating cluster-specific genes contributing to cell heterogeneity, which...

Full description

Saved in:
Bibliographic Details
Published in:Genome Biology Vol. 24; no. 1; pp. 1 - 28
Main Authors: Li, Yang, Wu, Mingcong, Ma, Shuangge, Wu, Mengyun
Format: Journal Article
Language:English
Published: London BioMed Central 11-09-2023
BMC
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Clustering is a critical component of single-cell RNA sequencing (scRNA-seq) data analysis and can help reveal cell types and infer cell lineages. Despite considerable successes, there are few methods tailored to investigating cluster-specific genes contributing to cell heterogeneity, which can promote biological understanding of cell heterogeneity. In this study, we propose a zero-inflated negative binomial mixture model (ZINBMM) that simultaneously achieves effective scRNA-seq data clustering and gene selection. ZINBMM conducts a systemic analysis on raw counts, accommodating both batch effects and dropout events. Simulations and the analysis of five scRNA-seq datasets demonstrate the practical applicability of ZINBMM.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1474-760X
1474-7596
1474-760X
DOI:10.1186/s13059-023-03046-0