Parallax: Visualizing and Understanding the Semantics of Embedding Spaces via Algebraic Formulae

05/28/2019
by   Piero Molino, et al.
0

Embeddings are a fundamental component of many modern machine learning and natural language processing models. Understanding them and visualizing them is essential for gathering insights about the information they capture and the behavior of the models. State of the art in analyzing embeddings consists in projecting them in two-dimensional planes without any interpretable semantics associated to the axes of the projection, which makes detailed analyses and comparison among multiple sets of embeddings challenging. In this work, we propose to use explicit axes defined as algebraic formulae over embeddings to project them into a lower dimensional, but semantically meaningful subspace, as a simple yet effective analysis and visualization methodology. This methodology assigns an interpretable semantics to the measures of variability and the axes of visualizations, allowing for both comparisons among different sets of embeddings and fine-grained inspection of the embedding spaces. We demonstrate the power of the proposed methodology through a series of case studies that make use of visualizations constructed around the underlying methodology and through a user study. The results show how the methodology is effective at providing more profound insights than classical projection methods and how it is widely applicable to many other use cases.

READ FULL TEXT

page 13

page 18

page 20

page 22

page 23

page 25

research
12/10/2019

Embedding Comparator: Visualizing Differences in Global Structure and Local Neighborhoods via Small Multiples

Embeddings – mappings from high-dimensional discrete input to lower-dime...
research
11/05/2019

embComp: Visual Interactive Comparison of Vector Embeddings

This work introduces embComp, a novel approach for comparing two embeddi...
research
09/23/2022

Incorporation of Human Knowledge into Data Embeddings to Improve Pattern Significance and Interpretability

Embedding is a common technique for analyzing multi-dimensional data. Ho...
research
08/18/2015

Learning Meta-Embeddings by Using Ensembles of Embedding Sets

Word embeddings -- distributed representations of words -- in deep learn...
research
02/05/2022

Emblaze: Illuminating Machine Learning Representations through Interactive Comparison of Embedding Spaces

Modern machine learning techniques commonly rely on complex, high-dimens...
research
08/01/2022

RISeer: Inspecting the Status and Dynamics of Regional Industrial Structure via Visual Analytics

Restructuring the regional industrial structure (RIS) has the potential ...
research
06/21/2022

Boosting Performance Optimization with Interactive Data Movement Visualization

Optimizing application performance in today's hardware architecture land...

Please sign up or login with your details

Forgot password? Click here to reset