FuxiCTR: An Open Benchmark for Click-Through Rate Prediction

09/12/2020
by   Jieming Zhu, et al.
6

In many applications, such as recommender systems, online advertising, and product search, click-through rate (CTR) prediction is a critical task, because its accuracy has a direct impact on both platform revenue and user experience. In recent years, with the prevalence of deep learning, CTR prediction has been widely studied in both academia and industry, resulting in an abundance of deep CTR models. Unfortunately, there is still a lack of a standardized benchmark and uniform evaluation protocols for CTR prediction. This leads to the non-reproducible and even inconsistent experimental results among these studies. In this paper, we present an open benchmark (namely FuxiCTR) for reproducible research and provide a rigorous comparison of different models for CTR prediction. Specifically, we ran over 4,600 experiments for a total of more than 12,000 GPU hours in a uniform framework to re-evaluate 24 existing models on two widely-used datasets, Criteo and Avazu. Surprisingly, our experiments show that many models have smaller differences than expected and sometimes are even inconsistent with what reported in the literature. We believe that our benchmark could not only allow researchers to gauge the effectiveness of new models conveniently, but also share some good practices to fairly compare with the state of the arts. We will release all the code and benchmark settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2018

An empirical study of public data quality problems in cross project defect prediction

Background: Two public defect data, including Jureczko and NASA datasets...
research
09/12/2019

How robust is MovieLens? A dataset analysis for recommender systems

Research publication requires public datasets. In recommender systems, s...
research
07/01/2019

An Open Source AutoML Benchmark

In recent years, an active field of research has developed around automa...
research
01/07/2021

User Response Prediction in Online Advertising

Online advertising, as the vast market, has gained significant attention...
research
08/24/2022

Next-Year Bankruptcy Prediction from Textual Data: Benchmark and Baselines

Models for bankruptcy prediction are useful in several real-world scenar...
research
06/26/2020

AutoRec: An Automated Recommender System

Realistic recommender systems are often required to adapt to ever-changi...
research
06/08/2022

DebiasBench: Benchmark for Fair Comparison of Debiasing in Image Classification

Image classifiers often rely overly on peripheral attributes that have a...

Please sign up or login with your details

Forgot password? Click here to reset