Dependency detection with similarity constraints

01/31/2011
by   Leo Lahti, et al.
0

Unsupervised two-view learning, or detection of dependencies between two paired data sets, is typically done by some variant of canonical correlation analysis (CCA). CCA searches for a linear projection for each view, such that the correlations between the projections are maximized. The solution is invariant to any linear transformation of either or both of the views; for tasks with small sample size such flexibility implies overfitting, which is even worse for more flexible nonparametric or kernel-based dependency discovery methods. We develop variants which reduce the degrees of freedom by assuming constraints on similarity of the projections in the two views. A particular example is provided by a cancer gene discovery application where chromosomal distance affects the dependencies between gene copy number and activity levels. Similarity constraints are shown to improve detection performance of known cancer genes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2015

Nonparametric Canonical Correlation Analysis

Canonical correlation analysis (CCA) is a classical representation learn...
research
02/24/2018

Correlating Cellular Features with Gene Expression using CCA

To understand the biology of cancer, joint analysis of multiple data mod...
research
12/21/2018

Canonical Correlation Analysis for Misaligned Satellite Image Change Detection

Canonical correlation analysis (CCA) is a statistical learning method th...
research
11/19/2015

An Information Retrieval Approach to Finding Dependent Subspaces of Multiple Views

Finding relationships between multiple views of data is essential both f...
research
06/01/2020

Bayesian Sparse Factor Analysis with Kernelized Observations

Latent variable models for multi-view learning attempt to find low-dimen...
research
03/24/2021

The Complexity of Dependency Detection and Discovery in Relational Databases

Multi-column dependencies in relational databases come associated with t...
research
09/05/2022

Explaining the optimistic performance evaluation of newly proposed methods: a cross-design validation experiment

The constant development of new data analysis methods in many fields of ...

Please sign up or login with your details

Forgot password? Click here to reset