Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Subject clustering by IF-PCA and several recent methods

View through CrossRef
Subject clustering (i.e., the use of measured features to cluster subjects, such as patients or cells, into multiple groups) is a problem of significant interest. In recent years, many approaches have been proposed, among which unsupervised deep learning (UDL) has received much attention. Two interesting questions are 1) how to combine the strengths of UDL and other approaches and 2) how these approaches compare to each other. We combine the variational auto-encoder (VAE), a popular UDL approach, with the recent idea of influential feature-principal component analysis (IF-PCA) and propose IF-VAE as a new method for subject clustering. We study IF-VAE and compare it with several other methods (including IF-PCA, VAE, Seurat, and SC3) on 10 gene microarray data sets and eight single-cell RNA-seq data sets. We find that IF-VAE shows significant improvement over VAE, but still underperforms compared to IF-PCA. We also find that IF-PCA is quite competitive, slightly outperforming Seurat and SC3 over the eight single-cell data sets. IF-PCA is conceptually simple and permits delicate analysis. We demonstrate that IF-PCA is capable of achieving phase transition in a rare/weak model. Comparatively, Seurat and SC3 are more complex and theoretically difficult to analyze (for these reasons, their optimality remains unclear).
Title: Subject clustering by IF-PCA and several recent methods
Description:
Subject clustering (i.
e.
, the use of measured features to cluster subjects, such as patients or cells, into multiple groups) is a problem of significant interest.
In recent years, many approaches have been proposed, among which unsupervised deep learning (UDL) has received much attention.
Two interesting questions are 1) how to combine the strengths of UDL and other approaches and 2) how these approaches compare to each other.
We combine the variational auto-encoder (VAE), a popular UDL approach, with the recent idea of influential feature-principal component analysis (IF-PCA) and propose IF-VAE as a new method for subject clustering.
We study IF-VAE and compare it with several other methods (including IF-PCA, VAE, Seurat, and SC3) on 10 gene microarray data sets and eight single-cell RNA-seq data sets.
We find that IF-VAE shows significant improvement over VAE, but still underperforms compared to IF-PCA.
We also find that IF-PCA is quite competitive, slightly outperforming Seurat and SC3 over the eight single-cell data sets.
IF-PCA is conceptually simple and permits delicate analysis.
We demonstrate that IF-PCA is capable of achieving phase transition in a rare/weak model.
Comparatively, Seurat and SC3 are more complex and theoretically difficult to analyze (for these reasons, their optimality remains unclear).

Related Results

The Kernel Rough K-Means Algorithm
The Kernel Rough K-Means Algorithm
Background: Clustering is one of the most important data mining methods. The k-means (c-means ) and its derivative methods are the hotspot in the field of clustering research in re...
Abstract 2708: Toward improved cancer classification using PCA + tSNE dimensionality reduction on bulk RNA-seq data
Abstract 2708: Toward improved cancer classification using PCA + tSNE dimensionality reduction on bulk RNA-seq data
Abstract Intro: Minor variations in cancer type can have a major impact on therapeutic effectiveness and on the course of drug research and development. In order to ...
Racial variation in the reliability of prostate cancer indicators in men undergoing subsequent prostate biopsy.
Racial variation in the reliability of prostate cancer indicators in men undergoing subsequent prostate biopsy.
115 Background: Many men with an initial negative prostate biopsy have a persistently elevated prostate specific antigen (PSA) prompting physicians to perform repeat biopsies. Afr...
Association between PSA density and pathologically significant prostate cancer: The impact of prostate volume
Association between PSA density and pathologically significant prostate cancer: The impact of prostate volume
AbstractBackgroundThe early diagnosis of prostate cancer (PCa) is mainly based on prostate‐specific antigen (PSA) blood levels and digital rectal examination. However, this approac...
Abstract 1464: Zaniya Mark
Abstract 1464: Zaniya Mark
Abstract Prostate cancer (PCa) is one of the most common types of cancers diagnosed in American men. Moreover, PCa malignancy disproportionally strikes more on Afric...
Family history is significantly associated with prostate cancer and its early onset in Chinese population
Family history is significantly associated with prostate cancer and its early onset in Chinese population
AbstractBackgroundFamily history (FH) of prostate cancer (PCa) in Chinese population is poorly understood. The objective of this study is to evaluate the association between FH and...

Back to Top