Optimizing Feature Set for Click-Through Rate Prediction

01/26/2023
by   Fuyuan Lyu, et al.
0

Click-through prediction (CTR) models transform features into latent vectors and enumerate possible feature interactions to improve performance based on the input feature set. Therefore, when selecting an optimal feature set, we should consider the influence of both feature and its interaction. However, most previous works focus on either feature field selection or only select feature interaction based on the fixed feature set to produce the feature set. The former restricts search space to the feature field, which is too coarse to determine subtle features. They also do not filter useless feature interactions, leading to higher computation costs and degraded model performance. The latter identifies useful feature interaction from all available features, resulting in many redundant features in the feature set. In this paper, we propose a novel method named OptFS to address these problems. To unify the selection of feature and its interaction, we decompose the selection of each feature interaction into the selection of two correlated features. Such a decomposition makes the model end-to-end trainable given various feature interaction operations. By adopting feature-level search space, we set a learnable gate to determine whether each feature should be within the feature set. Because of the large-scale search space, we develop a learning-by-continuation training scheme to learn such gates. Hence, OptFS generates the feature set only containing features which improve the final prediction results. Experimentally, we evaluate OptFS on three public datasets, demonstrating OptFS can optimize feature sets which enhance the model performance and further reduce both the storage and computational cost.

READ FULL TEXT
research
03/25/2020

AutoFIS: Automatic Feature Interaction Selection in Factorization Models for Click-Through Rate Prediction

Learning effective feature interactions is crucial for click-through rat...
research
11/05/2021

AIM: Automatic Interaction Machine for Click-Through Rate Prediction

Feature embedding learning and feature interaction modeling are two cruc...
research
12/13/2019

Neural Network Surgery with Sets

The cost to train machine learning models has been increasing exponentia...
research
11/10/2022

A metaheuristic multi-objective interaction-aware feature selection method

Multi-objective feature selection is one of the most significant issues ...
research
12/16/2020

AutoDis: Automatic Discretization for Embedding Numerical Features in CTR Prediction

Learning sophisticated feature interactions is crucial for Click-Through...
research
08/09/2022

OptEmbed: Learning Optimal Embedding Table for Click-through Rate Prediction

Learning embedding table plays a fundamental role in Click-through rate(...
research
11/11/2020

CAN: Revisiting Feature Co-Action for Click-Through Rate Prediction

Inspired by the success of deep learning, recent industrial Click-Throug...

Please sign up or login with your details

Forgot password? Click here to reset