Javascript must be enabled to continue!
Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data
View through CrossRef
AbstractGene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA-seq data, which helps to decipher single-cell heterogeneity and cell-type-specific variability by incorporating prior knowledge from functional gene sets. Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell-type specific markers as if in scRNA-seq. However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated. We systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five single-cell RNA-seq (scRNA-seq) tools, and one scATAC-seq method. First, using matched scATAC-seq and scRNA-seq datasets, we find that the performance of GSS tools on scATAC-seq data is comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq. Then the performance of different GSS tools were extensively evaluated using up to ten scATAC-seq datasets. Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS. Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more GSS tool or dataset dependent. Finally, we provided practical guidelines for choosing appropriate pre-processing methods and GSS tools in different scenarios.
Title: Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data
Description:
AbstractGene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA-seq data, which helps to decipher single-cell heterogeneity and cell-type-specific variability by incorporating prior knowledge from functional gene sets.
Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell-type specific markers as if in scRNA-seq.
However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated.
We systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five single-cell RNA-seq (scRNA-seq) tools, and one scATAC-seq method.
First, using matched scATAC-seq and scRNA-seq datasets, we find that the performance of GSS tools on scATAC-seq data is comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq.
Then the performance of different GSS tools were extensively evaluated using up to ten scATAC-seq datasets.
Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS.
Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more GSS tool or dataset dependent.
Finally, we provided practical guidelines for choosing appropriate pre-processing methods and GSS tools in different scenarios.
Related Results
Nurullah Ataç’ın Sözcüklerinin Kaynakları
Nurullah Ataç’ın Sözcüklerinin Kaynakları
Başlangıçta öz Türkçecilik hareketine mesafeli duran Nurullah Ataç, özellikle 1940’ların ortalarından itibaren özleştirmenin en büyük savunucularından biri olmuştur. Ataç’ın yazıla...
Complex Collision Tumors: A Systematic Review
Complex Collision Tumors: A Systematic Review
Abstract
Introduction: A collision tumor consists of two distinct neoplastic components located within the same organ, separated by stromal tissue, without histological intermixing...
Global Prediction of Chromatin Accessibility Using RNA-seq from Small Number of Cells
Global Prediction of Chromatin Accessibility Using RNA-seq from Small Number of Cells
ABSTRACT
Conventional high-throughput technologies for mapping regulatory element activities such as ChIP-seq, DNase-seq and FAIRE-seq cannot analyze samples with s...
Generating Synthetic Single Cell Data from Bulk RNA-seq Using a Pretrained Variational Autoencoder
Generating Synthetic Single Cell Data from Bulk RNA-seq Using a Pretrained Variational Autoencoder
AbstractSingle cell RNA sequencing (scRNA-seq) is a powerful approach which generates genome-wide gene expression profiles at single cell resolution. Among its many applications, i...
Evolving benchmarking practices: a review for research perspectives
Evolving benchmarking practices: a review for research perspectives
PurposeThe purpose of this study is to review a major section of the literature on benchmarking practices in order to achieve better perspectives for emerging benchmarking research...
Perceptions about benchmarking best practices among French managers: an exploratory survey
Perceptions about benchmarking best practices among French managers: an exploratory survey
PurposeThe purpose of this study is to present a discussion on the most commonly accepted benchmarking norms in the USA, the lessons learned from benchmarking experiences and see h...
ATAC-STARR-seq v1
ATAC-STARR-seq v1
Transcriptional enhancers control cell-type specific gene expression in humans and dysfunction can lead to debilitating diseases, including cancer. Identifying bona-fide enhancers ...
ATAC-STARR-seq v1
ATAC-STARR-seq v1
Transcriptional enhancers control cell-type specific gene expression in humans and dysfunction can lead to debilitating diseases, including cancer. Identifying bona-fide enhancers ...

