NCVis: Noise Contrastive Approach for Scalable Visualization

01/30/2020
by   Aleksandr Artemenkov, et al.
14

Modern methods for data visualization via dimensionality reduction, such as t-SNE, usually have performance issues that prohibit their application to large amounts of high-dimensional data. In this work, we propose NCVis – a high-performance dimensionality reduction method built on a sound statistical basis of noise contrastive estimation. We show that NCVis outperforms state-of-the-art techniques in terms of speed while preserving the representation quality of other methods. In particular, the proposed approach successfully proceeds a large dataset of more than 1 million news headlines in several minutes and presents the underlying structure in a human-readable way. Moreover, it provides results consistent with classical methods like t-SNE on more straightforward datasets like images of hand-written digits. We believe that the broader usage of such software can significantly simplify the large-scale data analysis and lower the entry barrier to this area.

READ FULL TEXT
research
03/10/2021

A Local Similarity-Preserving Framework for Nonlinear Dimensionality Reduction with Neural Networks

Real-world data usually have high dimensionality and it is important to ...
research
01/03/2022

Scalable semi-supervised dimensionality reduction with GPU-accelerated EmbedSOM

Dimensionality reduction methods have found vast application as visualiz...
research
05/10/2019

Supporting Analysis of Dimensionality Reduction Results with Contrastive Learning

Dimensionality reduction (DR) is frequently used for analyzing and visua...
research
02/12/2019

High dimensionality: The latest challenge to data analysis

The advent of modern technology, permitting the measurement of thousands...
research
11/28/2021

Dimensionality Reduction of Longitudinal 'Omics Data using Modern Tensor Factorization

Precision medicine is a clinical approach for disease prevention, detect...
research
07/05/2021

An Analytical Survey on Recent Trends in High Dimensional Data Visualization

Data visualization is the process by which data of any size or dimension...
research
01/15/2020

ShapeVis: High-dimensional Data Visualization at Scale

We present ShapeVis, a scalable visualization technique for point cloud ...

Please sign up or login with your details

Forgot password? Click here to reset