Causal Feature Selection with Dimension Reduction for Interpretable Text Classification

10/09/2020
by   Guohou Shan, et al.
0

Text features that are correlated with class labels, but do not directly cause them, are sometimesuseful for prediction, but they may not be insightful. As an alternative to traditional correlation-basedfeature selection, causal inference could reveal more principled, meaningful relationships betweentext features and labels. To help researchers gain insight into text data, e.g. for social scienceapplications, in this paper we investigate a class of matching-based causal inference methods fortext feature selection. Features used in document classification are often high dimensional, howeverexisting causal feature selection methods use Propensity Score Matching (PSM) which is known to beless effective in high-dimensional spaces. We propose a new causal feature selection framework thatcombines dimension reduction with causal inference to improve text feature selection. Experiments onboth synthetic and real-world data demonstrate the promise of our methods in improving classificationand enhancing interpretability.

READ FULL TEXT
research
11/25/2022

Graph Convolutional Network-based Feature Selection for High-dimensional and Low-sample Size Data

Feature selection is a powerful dimension reduction technique which sele...
research
01/21/2020

Nonparametric Causal Feature Selection for Spatiotemporal Risk Mapping of Malaria Incidence in Madagascar

Modern disease mapping uses high resolution environmental and socioecono...
research
11/17/2019

Causality-based Feature Selection: Methods and Evaluations

Feature selection is a crucial preprocessing step in data analytics and ...
research
07/04/2017

Kernel Feature Selection via Conditional Covariance Minimization

We propose a framework for feature selection that employs kernel-based m...
research
03/02/2019

FRI - Feature Relevance Intervals for Interpretable and Interactive Data Exploration

Most existing feature selection methods are insufficient for analytic pu...
research
04/26/2013

Learning Densities Conditional on Many Interacting Features

Learning a distribution conditional on a set of discrete-valued features...

Please sign up or login with your details

Forgot password? Click here to reset