Javascript must be enabled to continue!
Explaining Projections of High-Dimensional Data
View through CrossRef
Visualization techniques and methods are often a key aid for scientists who aim to form, refine, or invalidate hypotheses about underlying phenomena based on multidimensional datasets. Among such techniques, dimensionality reduction techniques, also called projections, offer significant advantages with respect to other visualization techniques in terms of their computational and visual scalability in both the number of samples and dimensions, and have hence become one of the most used visualizations of multidimensional data. However, projections create images which are hard, if not impossible, to interpret in detail without additional visual help.
In this thesis, we study how to enrich existing projection techniques by so called interactive visual explanatory mechanisms.
Our first contribution is an extension of an existing family of local explanation techniques to characterize the data in terms of local dimension correlation and intrinsic dimensionality. We show that our extensions, which are simple to implement, efficient to compute, and applicable to any projection method, can significantly contribute -- when combined with existing explanations -- to a better understanding of the visualized dataset.
Our second contribution studies the usage of projections that create 3D scatterplots as opposed to the traditional 2D ones that most existing projection methods employ. We show that 3D projections offer only minimal improvements with respect to existing quality metrics used to measure projections. However, when annotated with our explanations, and since interactively viewable from multiple viewpoints, 3D projections create a stronger involvement of the user in exploring the depicted data. To further study how the insights obtained from a projection depend on the chosen viewpoints, we propose quality metrics to characterize the structures visible from any given viewpoint. We also propose an interactive tool to guide users in finding good-quality viewpoints. We conduct a user study that shows that our quality metrics agree with viewpoints perceived as useful by users both when guided by, and when not having, our interactive tool.
Our final contribution explains multidimensional projections from the perspective of their computational stability. For this, we propose to use a variant of sensitivity analysis -- a well-known technique in signal processing but, to our knowledge, not having been used yet in assessing projections. We show that a recent deep-learning technique (NNP), which excels in computational speed, simplicity of use, genericity, quality, and out-of-sample ability, also meets the stability requirement as it exhibits only small output changes for significantly large changes of its input data for a range of perturbations.
Title: Explaining Projections of High-Dimensional Data
Description:
Visualization techniques and methods are often a key aid for scientists who aim to form, refine, or invalidate hypotheses about underlying phenomena based on multidimensional datasets.
Among such techniques, dimensionality reduction techniques, also called projections, offer significant advantages with respect to other visualization techniques in terms of their computational and visual scalability in both the number of samples and dimensions, and have hence become one of the most used visualizations of multidimensional data.
However, projections create images which are hard, if not impossible, to interpret in detail without additional visual help.
In this thesis, we study how to enrich existing projection techniques by so called interactive visual explanatory mechanisms.
Our first contribution is an extension of an existing family of local explanation techniques to characterize the data in terms of local dimension correlation and intrinsic dimensionality.
We show that our extensions, which are simple to implement, efficient to compute, and applicable to any projection method, can significantly contribute -- when combined with existing explanations -- to a better understanding of the visualized dataset.
Our second contribution studies the usage of projections that create 3D scatterplots as opposed to the traditional 2D ones that most existing projection methods employ.
We show that 3D projections offer only minimal improvements with respect to existing quality metrics used to measure projections.
However, when annotated with our explanations, and since interactively viewable from multiple viewpoints, 3D projections create a stronger involvement of the user in exploring the depicted data.
To further study how the insights obtained from a projection depend on the chosen viewpoints, we propose quality metrics to characterize the structures visible from any given viewpoint.
We also propose an interactive tool to guide users in finding good-quality viewpoints.
We conduct a user study that shows that our quality metrics agree with viewpoints perceived as useful by users both when guided by, and when not having, our interactive tool.
Our final contribution explains multidimensional projections from the perspective of their computational stability.
For this, we propose to use a variant of sensitivity analysis -- a well-known technique in signal processing but, to our knowledge, not having been used yet in assessing projections.
We show that a recent deep-learning technique (NNP), which excels in computational speed, simplicity of use, genericity, quality, and out-of-sample ability, also meets the stability requirement as it exhibits only small output changes for significantly large changes of its input data for a range of perturbations.
Related Results
Future flood frequency curve of the Arno River (Central Italy) by using bias-corrected convection-permitting model projections in a semi-distributed hydrological model
Future flood frequency curve of the Arno River (Central Italy) by using bias-corrected convection-permitting model projections in a semi-distributed hydrological model
Understanding how climate change affects the frequency and magnitude of floods is essential for adaptation strategies. Usually, the impact of climate change on extreme weather and ...
THREE-DIMENSIONAL HOLOGRAPHIC OPTICAL ELEMENTS BASED ON NEW MICROSYSTEMS
THREE-DIMENSIONAL HOLOGRAPHIC OPTICAL ELEMENTS BASED ON NEW MICROSYSTEMS
The origination and improvement of holographic methods, as well as technical equipment for their implementation [1–3] revived interest in light diffraction in three-dimensional per...
Evaluating the Consistency between Statistically Downscaled and Global Dynamical Model Climate Change Projections
Evaluating the Consistency between Statistically Downscaled and Global Dynamical Model Climate Change Projections
Abstract
The consistency between rainfall projections obtained from direct climate model output and statistical downscaling is evaluated. Results are averaged across...
High Dimensional Computing on Arabic Language Classification
High Dimensional Computing on Arabic Language Classification
Abstract
The brain circuit is enormous regarding quantities of neurons and neuro-transmitters, proposing that huge circuits are the main entity to the brain-core processing...
Two-dimensional function photonic crystal
Two-dimensional function photonic crystal
Photonic crystal is a kind of periodic optical nanostructure consisting of two or more materials with different dielectric constants, which has attracted great deal of attention be...
Estimating and projecting subacute care demand: findings from a review of international methods
Estimating and projecting subacute care demand: findings from a review of international methods
A review of projection methodologies used to
project sub-acute inpatient activity in various international
health care jurisdictions was undertaken
as part of a project to develop ...
Radiation doses and estimated risk from angiographic projections during coronary angiography performed using novel flat detector
Radiation doses and estimated risk from angiographic projections during coronary angiography performed using novel flat detector
Coronary angiography (CA) procedure uses various angiographic projections to elicit detailed information of the coronary arteries with some steep projections involving high radiati...
Four-week forecasts of COVID-19 epidemic trajectories in South Africa, Chile, Peru and Brazil: a model evaluation
Four-week forecasts of COVID-19 epidemic trajectories in South Africa, Chile, Peru and Brazil: a model evaluation
ABSTRACTIntroductionFrom the beginning of the COVID-19 pandemic, epidemiological models have been used in a number of ways to aid governments and organizations in efficient plannin...


