Universal Dependency Analysis

10/28/2015
by   Hoang-Vu Nguyen, et al.
0

Most data is multi-dimensional. Discovering whether any subset of dimensions, or subspaces, of such data is significantly correlated is a core task in data mining. To do so, we require a measure that quantifies how correlated a subspace is. For practical use, such a measure should be universal in the sense that it captures correlation in subspaces of any dimensionality and allows to meaningfully compare correlation scores across different subspaces, regardless how many dimensions they have and what specific statistical properties their dimensions possess. Further, it would be nice if the measure can non-parametrically and efficiently capture both linear and non-linear correlations. In this paper, we propose UDS, a multivariate correlation measure that fulfills all of these desiderata. In short, we define based on cumulative entropy and propose a principled normalization scheme to bring its scores across different subspaces to the same domain, enabling universal correlation assessment. UDS is purely non-parametric as we make no assumption on data distributions nor types of correlation. To compute it on empirical data, we introduce an efficient and non-parametric method. Extensive experiments show that UDS outperforms state of the art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2020

A new effective and efficient measure for outlying aspect mining

Outlying Aspect Mining (OAM) aims to find the subspaces (a.k.a. aspects)...
research
01/26/2018

Correlated Components Analysis --- Extracting Reliable Dimensions in Multivariate Data

How does one find data dimensions that are reliably expressed across rep...
research
08/30/2019

Discovering Reliable Correlations in Categorical Data

In many scientific tasks we are interested in discovering whether there ...
research
09/11/2018

Multivariate Brenier cumulative distribution functions and their application to non-parametric testing

In this work we introduce a novel approach of construction of multivaria...
research
04/11/2018

Dynamic Multivariate Functional Data Modeling via Sparse Subspace Learning

Multivariate functional data from a complex system are naturally high-di...
research
02/19/2018

Subspace Network: Deep Multi-Task Censored Regression for Modeling Neurodegenerative Diseases

Over the past decade a wide spectrum of machine learning models have bee...
research
01/22/2022

Neuronal Correlation: a Central Concept in Neural Network

This paper proposes to study neural networks through neuronal correlatio...

Please sign up or login with your details

Forgot password? Click here to reset