Javascript must be enabled to continue!

Conditional t-SNE: more informative t-SNE embeddings

AbstractDimensionality reduction and manifold learning methods such as t-distributed stochastic neighbor embedding (t-SNE) are frequently used to map high-dimensional data into a two-dimensional space to visualize and explore that data. Going beyond the specifics of t-SNE, there are two substantial limitations of any such approach: (1) not all information can be captured in a single two-dimensional embedding, and (2) to well-informed users, the salient structure of such an embedding is often already known, preventing that any real new insights can be obtained. Currently, it is not known how to extract the remaining information in a similarly effective manner. We introduce conditional t-SNE (ct-SNE), a generalization of t-SNE that discounts prior information in the form of labels. This enables obtaining more informative and more relevant embeddings. To achieve this, we propose a conditioned version of the t-SNE objective, obtaining an elegant method with a single integrated objective. We show how to efficiently optimize the objective and study the effects of the extra parameter that ct-SNE has over t-SNE. Qualitative and quantitative empirical results on synthetic and real data show ct-SNE is scalable, effective, and achieves its goal: it allows complementary structure to be captured in the embedding and provided new insights into real data.

Springer Science and Business Media LLC

Bo Kang Darío García García Jefrey Lijffijt Raúl Santos-Rodríguez Tijl De Bie

Machine Learning

2020

Title: Conditional t-SNE: more informative t-SNE embeddings

Description:

Going beyond the specifics of t-SNE, there are two substantial limitations of any such approach: (1) not all information can be captured in a single two-dimensional embedding, and (2) to well-informed users, the salient structure of such an embedding is often already known, preventing that any real new insights can be obtained.

Currently, it is not known how to extract the remaining information in a similarly effective manner.

We introduce conditional t-SNE (ct-SNE), a generalization of t-SNE that discounts prior information in the form of labels.

This enables obtaining more informative and more relevant embeddings.

To achieve this, we propose a conditioned version of the t-SNE objective, obtaining an elegant method with a single integrated objective.

We show how to efficiently optimize the objective and study the effects of the extra parameter that ct-SNE has over t-SNE.

Qualitative and quantitative empirical results on synthetic and real data show ct-SNE is scalable, effective, and achieves its goal: it allows complementary structure to be captured in the embedding and provided new insights into real data.

Back

Type Ibn supernovae (SNe) are a mysterious class of transients whose spectra exhibit persistently narrow He I lines, and whose bolometric light curves are typically fast evolving a...

Analyses des propriétés locales des galaxies hôtes des Supernovae de type Ia dans la collaboration The Nearby Supernova Factory

Les supernovae de type Ia (SNe Ia) sont de puissants indicateurs de distance cosmologique. Elles sont à l'origine de la découverte de l'énergie noire dans l'univers et restent aujo...

Exploring Word Embeddings for Text Classification: A Comparative Analysis

For language tasks like text classification and sequence labeling, word embeddings are essential for providing input characteristics in deep models. There have been many word embed...

Exploring the Privacy-Preserving Properties of Word Embeddings: Algorithmic Validation Study (Preprint)

BACKGROUND Word embeddings are dense numeric vectors used to represent language in neural networks. Until recently, there had been no publicly released embe...

Conditional Constructions in Yemsa

Introduction. The main objective of this study is to produce a comprehensive description of Yemsa conditional constructions. The existing studies do not describe conditional clause...

Pleiotropy and the evolutionary stability of plastic phenotypes: a geometric framework

Phenotypic plasticity allows organisms to express different traits in response to different environmental or genetic conditions. Understanding the evolution of conditional phenotyp...

When Word Embeddings Become Endangered

Big languages such as English and Finnish have many natural language processing (NLP) resources and models, but this is not the case for low-resourced and endangered languages as s...

Progenitors of Type IIb Supernovae. I. Evolutionary Pathways and Rates

Abstract Type IIb supernovae (SNe) are important candidates to understand mechanisms that drive the stripping of stripped-envelope (SE) supernova (SN) progenitors. W...

Email:
Password:

Email:

Conditional t-SNE: more informative t-SNE embeddings

Related Results