Multi-label Causal Variable Discovery: Learning Common Causal Variables and Label-specific Causal Variables

11/09/2020
by   Xingyu Wu, et al.
1

Causal variables in Markov boundary (MB) have been widely applied in extensive single-label tasks. While few researches focus on the causal variable discovery in multi-label data due to the complex causal relationships. Since some variables in multi-label scenario might contain causal information about multiple labels, this paper investigates the problem of multi-label causal variable discovery as well as the distinguishing between common causal variables shared by multiple labels and label-specific causal variables associated with some single labels. Considering the multiple MBs under the non-positive joint probability distribution, we explore the relationships between common causal variables and equivalent information phenomenon, and find that the solutions are influenced by equivalent information following different mechanisms with or without existence of label causality. Analyzing these mechanisms, we provide the theoretical property of common causal variables, based on which the discovery and distinguishing algorithm is designed to identify these two types of variables. Similar to single-label problem, causal variables for multiple labels also have extensive application prospects. To demonstrate this, we apply the proposed causal mechanism to multi-label feature selection and present an interpretable algorithm, which is proved to achieve the minimal redundancy and the maximum relevance. Extensive experiments demonstrate the efficacy of these contributions.

READ FULL TEXT

page 5

page 6

page 7

page 8

page 9

page 11

page 13

page 15

research
03/27/2023

A Survey on Causal Discovery Methods for Temporal and Non-Temporal Data

Causal Discovery (CD) is the process of identifying the cause-effect rel...
research
04/13/2022

Random Graph Embedding and Joint Sparse Regularization for Multi-label Feature Selection

Multi-label learning is often used to mine the correlation between varia...
research
06/12/2022

Mining Multi-Label Samples from Single Positive Labels

Conditional generative adversarial networks (cGANs) have shown superior ...
research
05/28/2019

Using Ontologies To Improve Performance In Massively Multi-label Prediction Models

Massively multi-label prediction/classification problems arise in enviro...
research
09/25/2021

Integrating Unsupervised Clustering and Label-specific Oversampling to Tackle Imbalanced Multi-label Data

There is often a mixture of very frequent labels and very infrequent lab...
research
05/12/2020

Unsupervised Multi-label Dataset Generation from Web Data

This paper presents a system towards the generation of multi-label datas...
research
06/18/2015

A hybrid algorithm for Bayesian network structure learning with application to multi-label learning

We present a novel hybrid algorithm for Bayesian network structure learn...

Please sign up or login with your details

Forgot password? Click here to reset