Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Conditional t-SNE: more informative t-SNE embeddings

View through CrossRef
AbstractDimensionality reduction and manifold learning methods such as t-distributed stochastic neighbor embedding (t-SNE) are frequently used to map high-dimensional data into a two-dimensional space to visualize and explore that data. Going beyond the specifics of t-SNE, there are two substantial limitations of any such approach: (1) not all information can be captured in a single two-dimensional embedding, and (2) to well-informed users, the salient structure of such an embedding is often already known, preventing that any real new insights can be obtained. Currently, it is not known how to extract the remaining information in a similarly effective manner. We introduce conditional t-SNE (ct-SNE), a generalization of t-SNE that discounts prior information in the form of labels. This enables obtaining more informative and more relevant embeddings. To achieve this, we propose a conditioned version of the t-SNE objective, obtaining an elegant method with a single integrated objective. We show how to efficiently optimize the objective and study the effects of the extra parameter that ct-SNE has over t-SNE. Qualitative and quantitative empirical results on synthetic and real data show ct-SNE is scalable, effective, and achieves its goal: it allows complementary structure to be captured in the embedding and provided new insights into real data.
Title: Conditional t-SNE: more informative t-SNE embeddings
Description:
AbstractDimensionality reduction and manifold learning methods such as t-distributed stochastic neighbor embedding (t-SNE) are frequently used to map high-dimensional data into a two-dimensional space to visualize and explore that data.
Going beyond the specifics of t-SNE, there are two substantial limitations of any such approach: (1) not all information can be captured in a single two-dimensional embedding, and (2) to well-informed users, the salient structure of such an embedding is often already known, preventing that any real new insights can be obtained.
Currently, it is not known how to extract the remaining information in a similarly effective manner.
We introduce conditional t-SNE (ct-SNE), a generalization of t-SNE that discounts prior information in the form of labels.
This enables obtaining more informative and more relevant embeddings.
To achieve this, we propose a conditioned version of the t-SNE objective, obtaining an elegant method with a single integrated objective.
We show how to efficiently optimize the objective and study the effects of the extra parameter that ct-SNE has over t-SNE.
Qualitative and quantitative empirical results on synthetic and real data show ct-SNE is scalable, effective, and achieves its goal: it allows complementary structure to be captured in the embedding and provided new insights into real data.

Related Results

Helium stars exploding in circumstellar material and the origin of Type Ibn supernovae
Helium stars exploding in circumstellar material and the origin of Type Ibn supernovae
Type Ibn supernovae (SNe) are a mysterious class of transients whose spectra exhibit persistently narrow He I lines, and whose bolometric light curves are typically fast evolving a...
Analyses des propriétés locales des galaxies hôtes des Supernovae de type Ia dans la collaboration The Nearby Supernova Factory
Analyses des propriétés locales des galaxies hôtes des Supernovae de type Ia dans la collaboration The Nearby Supernova Factory
Les supernovae de type Ia (SNe Ia) sont de puissants indicateurs de distance cosmologique. Elles sont à l'origine de la découverte de l'énergie noire dans l'univers et restent aujo...
Exploring Word Embeddings for Text Classification: A Comparative Analysis
Exploring Word Embeddings for Text Classification: A Comparative Analysis
For language tasks like text classification and sequence labeling, word embeddings are essential for providing input characteristics in deep models. There have been many word embed...
Exploring the Privacy-Preserving Properties of Word Embeddings: Algorithmic Validation Study (Preprint)
Exploring the Privacy-Preserving Properties of Word Embeddings: Algorithmic Validation Study (Preprint)
BACKGROUND Word embeddings are dense numeric vectors used to represent language in neural networks. Until recently, there had been no publicly released embe...
Conditional Constructions in Yemsa
Conditional Constructions in Yemsa
Introduction. The main objective of this study is to produce a comprehensive description of Yemsa conditional constructions. The existing studies do not describe conditional clause...
Pleiotropy and the evolutionary stability of plastic phenotypes: a geometric framework
Pleiotropy and the evolutionary stability of plastic phenotypes: a geometric framework
Phenotypic plasticity allows organisms to express different traits in response to different environmental or genetic conditions. Understanding the evolution of conditional phenotyp...
When Word Embeddings Become Endangered
When Word Embeddings Become Endangered
Big languages such as English and Finnish have many natural language processing (NLP) resources and models, but this is not the case for low-resourced and endangered languages as s...
Progenitors of Type IIb Supernovae. I. Evolutionary Pathways and Rates
Progenitors of Type IIb Supernovae. I. Evolutionary Pathways and Rates
Abstract Type IIb supernovae (SNe) are important candidates to understand mechanisms that drive the stripping of stripped-envelope (SE) supernova (SN) progenitors. W...

Back to Top