Efficient Classification with Counterfactual Reasoning and Active Learning

07/25/2022
by   Azhar Mohammed, et al.
0

Data augmentation is one of the most successful techniques to improve the classification accuracy of machine learning models in computer vision. However, applying data augmentation to tabular data is a challenging problem since it is hard to generate synthetic samples with labels. In this paper, we propose an efficient classifier with a novel data augmentation technique for tabular data. Our method called CCRAL combines causal reasoning to learn counterfactual samples for the original training samples and active learning to select useful counterfactual samples based on a region of uncertainty. By doing this, our method can maximize our model's generalization on the unseen testing data. We validate our method analytically, and compare with the standard baselines. Our experimental results highlight that CCRAL achieves significantly better performance than those of the baselines across several real-world tabular datasets in terms of accuracy and AUC. Data and source code are available at: https://github.com/nphdang/CCRAL.

READ FULL TEXT
research
07/30/2019

Safe Augmentation: Learning Task-Specific Transformations from Data

Data augmentation is widely used as a part of the training process appli...
research
04/26/2019

Bayesian Generative Active Deep Learning

Deep learning models have demonstrated outstanding performance in severa...
research
07/22/2022

CARBON: A Counterfactual Reasoning based Framework for Neural Code Comprehension Debiasing

Previous studies have demonstrated that code intelligence models are sen...
research
11/05/2020

Deep Active Learning with Augmentation-based Consistency Estimation

In active learning, the focus is mainly on the selection strategy of unl...
research
04/07/2022

Multi-Sample ζ-mixup: Richer, More Realistic Synthetic Samples from a p-Series Interpolant

Modern deep learning training procedures rely on model regularization te...
research
10/13/2022

Data augmentation on-the-fly and active learning in data stream classification

There is an emerging need for predictive models to be trained on-the-fly...
research
06/16/2021

Automating Augmentation Through Random Unidimensional Search

It is no secret amongst deep learning researchers that finding the right...

Please sign up or login with your details

Forgot password? Click here to reset