Unsupervised Heterogeneous Coupling Learning for Categorical Representation

07/21/2020
by   Chengzhang Zhu, et al.
0

Complex categorical data is often hierarchically coupled with heterogeneous relationships between attributes and attribute values and the couplings between objects. Such value-to-object couplings are heterogeneous with complementary and inconsistent interactions and distributions. Limited research exists on unlabeled categorical data representations, ignores the heterogeneous and hierarchical couplings, underestimates data characteristics and complexities, and overuses redundant information, etc. The deep representation learning of unlabeled categorical data is challenging, overseeing such value-to-object couplings, complementarity and inconsistency, and requiring large data, disentanglement, and high computational power. This work introduces a shallow but powerful UNsupervised heTerogeneous couplIng lEarning (UNTIE) approach for representing coupled categorical data by untying the interactions between couplings and revealing heterogeneous distributions embedded in each type of couplings. UNTIE is efficiently optimized w.r.t. a kernel k-means objective function for unsupervised representation learning of heterogeneous and hierarchical value-to-object couplings. Theoretical analysis shows that UNTIE can represent categorical data with maximal separability while effectively represent heterogeneous couplings and disclose their roles in categorical data. The UNTIE-learned representations make significant performance improvement against the state-of-the-art categorical representations and deep representation models on 25 categorical data sets with diversified characteristics.

READ FULL TEXT

page 3

page 5

page 6

page 7

page 10

page 13

page 14

page 16

research
05/25/2022

NECA: Network-Embedded Deep Representation Learning for Categorical Data

We propose NECA, a deep representation learning method for categorical d...
research
10/07/2020

FairMixRep : Self-supervised Robust Representation Learning for Heterogeneous Data with Fairness constraints

Representation Learning in a heterogeneous space with mixed variables of...
research
07/01/2020

Coupling Learning of Complex Interactions

Complex applications such as big data analytics involve different forms ...
research
06/13/2021

Linear representation of categorical values

We propose a binary representation of categorical values using a linear ...
research
03/26/2021

Categorical Representation Learning: Morphism is All You Need

We provide a construction for categorical representation learning and in...
research
12/25/2022

Evaluating Alternative Glyph Design for Showing Large-Magnitude-Range Quantum Spins

We present experimental results to explore a form of bivariate glyphs fo...
research
11/29/2017

Learning Interesting Categorical Attributes for Refined Data Exploration

This work proposes and evaluates a novel approach to determine interesti...

Please sign up or login with your details

Forgot password? Click here to reset