DIFER: Differentiable Automated Feature Engineering

10/17/2020
by   Guanghui Zhu, et al.
0

Feature engineering, a crucial step of machine learning, aims to extract useful features from raw data to improve data quality. In recent years, great efforts have been devoted to Automated Feature Engineering (AutoFE) to replace expensive human labor. However, existing methods are computationally demanding due to treating AutoFE as a coarse-grained black-box optimization problem over a discrete space. In this work, we propose an efficient gradient-based method called DIFER to perform differentiable automated feature engineering in a continuous vector space. DIFER selects potential features based on evolutionary algorithm and leverages an encoder-predictor-decoder controller to optimize existing features. We map features into the continuous vector space via the encoder, optimize the embedding along the gradient direction induced by the predicted score, and recover better features from the optimized embedding by the decoder. Extensive experiments on classification and regression datasets demonstrate that DIFER can significantly improve the performance of various machine learning algorithms and outperform current state-of-the-art AutoFE methods in terms of both efficiency and performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2019

Techniques for Automated Machine Learning

Automated machine learning (AutoML) aims to find optimal machine learnin...
research
05/19/2021

Mill.jl and JsonGrinder.jl: automated differentiable feature extraction for learning from raw JSON data

Learning from raw data input, thus limiting the need for manual feature ...
research
02/26/2023

Data-Centric AI: Deep Generative Differentiable Feature Selection via Discrete Subsetting as Continuous Embedding Space Optimization

Feature Selection (FS), such as filter, wrapper, and embedded methods, a...
research
06/18/2020

Neural Architecture Optimization with Graph VAE

Due to their high computational efficiency on a continuous space, gradie...
research
12/26/2022

Toward Efficient Automated Feature Engineering

Automated Feature Engineering (AFE) refers to automatically generate and...
research
10/22/2021

Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering

This paper introduces a two-phase deep feature engineering framework for...
research
07/31/2018

Leveraging Knowledge Graph Embedding Techniques for Industry 4.0 Use Cases

Industry is evolving towards Industry 4.0, which holds the promise of in...

Please sign up or login with your details

Forgot password? Click here to reset