Javascript must be enabled to continue!
Between Cluster Analysis: Supervised Dimensionality Reduction for Trajectory Inference
View through CrossRef
Abstract
Motivation
Single-cell RNA sequencing (scRNA-seq) measures the transcriptional state of individual cells, enabling more precise characterization of cell types, cell states, and developmental trajectories. Because of the high dimensionality of scRNA-seq data, a standard first step in scRNA-seq analysis is to perform dimensionality reduction. PCA and many other commonly used dimensionality reduction techniques are unsupervised, meaning that they do not incorporate any prior knowledge of the data being analyzed. On the other hand, nearly all trajectory inference methods are supervised, relying on information such as a clustering of cells into cell types/states.
Results
We introduce Between Cluster Analysis (BCA), a supervised linear dimensionality reduction technique that uses cluster labels of cells as prior information and computes an embedding that maximizes the between cluster variance. We show on both simulated and real data that BCA improves trajectory inference compared to other dimensionality reduction methods, including Linear Discriminant Analysis (LDA), another supervised linear dimensionality reduction method. Additionally, we observe that many of the commonly used metrics to evaluate trajectory inference evaluate only the ordering of cell types and not the identification or ordering of intermediate cell states. We propose an alternative measure to evaluate trajectory inference methods in preserving intermediate cells, especially when the ordering of these intermediate cells is unknown.
Availability
Code is available at https://github.com/raphael-group/BCA
Supplementary information
Supplementary data are available at Bioinformatics online.
Oxford University Press (OUP)
Title: Between Cluster Analysis: Supervised Dimensionality Reduction for Trajectory Inference
Description:
Abstract
Motivation
Single-cell RNA sequencing (scRNA-seq) measures the transcriptional state of individual cells, enabling more precise characterization of cell types, cell states, and developmental trajectories.
Because of the high dimensionality of scRNA-seq data, a standard first step in scRNA-seq analysis is to perform dimensionality reduction.
PCA and many other commonly used dimensionality reduction techniques are unsupervised, meaning that they do not incorporate any prior knowledge of the data being analyzed.
On the other hand, nearly all trajectory inference methods are supervised, relying on information such as a clustering of cells into cell types/states.
Results
We introduce Between Cluster Analysis (BCA), a supervised linear dimensionality reduction technique that uses cluster labels of cells as prior information and computes an embedding that maximizes the between cluster variance.
We show on both simulated and real data that BCA improves trajectory inference compared to other dimensionality reduction methods, including Linear Discriminant Analysis (LDA), another supervised linear dimensionality reduction method.
Additionally, we observe that many of the commonly used metrics to evaluate trajectory inference evaluate only the ordering of cell types and not the identification or ordering of intermediate cell states.
We propose an alternative measure to evaluate trajectory inference methods in preserving intermediate cells, especially when the ordering of these intermediate cells is unknown.
Availability
Code is available at https://github.
com/raphael-group/BCA
Supplementary information
Supplementary data are available at Bioinformatics online.
Related Results
Constructing a VANET based on cluster chains
Constructing a VANET based on cluster chains
SUMMARYThe paper proposes a scheme on constructing a vehicular ad‐hoc network based on cluster chains. In the cluster construction algorithm, the distance from a potential cluster ...
Regional directions of the cluster development strategy in the field of tourism and hospitality
Regional directions of the cluster development strategy in the field of tourism and hospitality
The monograph consists of an introduction, 5 chapters, lists of used sources for each chapter separately; contains 31 tables and 37 figures. The monograph examines the theoretical ...
A Trajectory Similarity Computation Method based on GAT-based Transformer and CNN model
A Trajectory Similarity Computation Method based on GAT-based Transformer and CNN model
Trajectory similarity computation is very important for trajectory data
mining. It is applied into many trajectory mining tasks, including
trajectory clustering, trajectory classif...
Control-Oriented Real-Time Trajectory Planning for Heterogeneous UAV Formations
Control-Oriented Real-Time Trajectory Planning for Heterogeneous UAV Formations
Aiming at the trajectory planning problem for heterogeneous UAV formations in complex environments, a trajectory prediction model combining Convolutional Neural Networks (CNNs) and...
Radiologic Patterning of Joint Damage to the Foot in Rheumatoid Arthritis
Radiologic Patterning of Joint Damage to the Foot in Rheumatoid Arthritis
ObjectiveFoot and ankle deformities greatly affect the quality of life of rheumatoid arthritis (RA) patients. The aim of this study was to elucidate the pattern of destruction of t...
Evolutionary Grammatical Inference
Evolutionary Grammatical Inference
Grammatical Inference (also known as grammar induction) is the problem of learning a grammar for a language from a set of examples. In a broad sense, some data is presented to the ...
A trajectory similarity computation method based on GAT-based transformer and CNN model
A trajectory similarity computation method based on GAT-based transformer and CNN model
AbstractTrajectory similarity computation is very important for trajectory data mining. It is applied into many trajectory mining tasks, including trajectory clustering, trajectory...
A high-dimensionality-trait-driven learning paradigm for high dimensional credit classification
A high-dimensionality-trait-driven learning paradigm for high dimensional credit classification
AbstractTo solve the high-dimensionality issue and improve its accuracy in credit risk assessment, a high-dimensionality-trait-driven learning paradigm is proposed for feature extr...

