Towards Data Quality Assessment in Online Advertising

11/30/2017
by   Sahin Cem Geyik, et al.
0

In online advertising, our aim is to match the advertisers with the most relevant users to optimize the campaign performance. In the pursuit of achieving this goal, multiple data sources provided by the advertisers or third-party data providers are utilized to choose the set of users according to the advertisers' targeting criteria. In this paper, we present a framework that can be applied to assess the quality of such data sources in large scale. This framework efficiently evaluates the similarity of a specific data source categorization to that of the ground truth, especially for those cases when the ground truth is accessible only in aggregate, and the user-level information is anonymized or unavailable due to privacy reasons. We propose multiple methodologies within this framework, present some preliminary assessment results, and evaluate how the methodologies compare to each other. We also present two use cases where we can utilize the data quality assessment results: the first use case is targeting specific user categories, and the second one is forecasting the desirable audiences we can reach for an online advertising campaign with pre-set targeting criteria.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2020

A Scalable Framework for Quality Assessment of RDF Datasets

Over the last years, Linked Data has grown continuously. Today, we count...
research
06/23/2016

Gender and Interest Targeting for Sponsored Post Advertising at Tumblr

As one of the leading platforms for creative content, Tumblr offers adve...
research
01/10/2020

Subjective Annotation for a Frame Interpolation Benchmark using Artifact Amplification

Current benchmarks for optical flow algorithms evaluate the estimation e...
research
01/31/2022

Eris: Measuring discord among multidimensional data sources

Data integration is a classical problem in databases, typically decompos...
research
02/24/2015

Multi-Touch Attribution Based Budget Allocation in Online Advertising

Budget allocation in online advertising deals with distributing the camp...
research
12/29/2018

Cross-Device Tracking: Systematic Method to Detect and Measure CDT

Online advertising, the backbone of the free Web, has transformed the ma...
research
06/15/2020

Algebraic Ground Truth Inference: Non-Parametric Estimation of Sample Errors by AI Algorithms

Binary classification is widely used in ML production systems. Monitorin...

Please sign up or login with your details

Forgot password? Click here to reset