Log In Sign Up

M2FN: Multi-step Modality Fusion for Advertisement Image Assessment

by   Kyung-Wha Park, et al.

Assessing advertisements, specifically on the basis of user preferences and ad quality, is crucial to the marketing industry. Although recent studies have attempted to use deep neural networks for this purpose, these studies have not utilized image-related auxiliary attributes, which include embedded text frequently found in ad images. We, therefore, investigated the influence of these attributes on ad image preferences. First, we analyzed large-scale real-world ad log data and, based on our findings, proposed a novel multi-step modality fusion network (M2FN) that determines advertising images likely to appeal to user preferences. Our method utilizes auxiliary attributes through multiple steps in the network, which include conditional batch normalization-based low-level fusion and attention-based high-level fusion. We verified M2FN on the AVA dataset, which is widely used for aesthetic image assessment, and then demonstrated that M2FN can achieve state-of-the-art performance in preference prediction using a real-world ad dataset with rich auxiliary attributes.


page 4

page 6

page 8

page 12

page 13

page 15


Which Ads to Show? Advertisement Image Assessment with Auxiliary Information via Multi-step Modality Fusion

Assessing aesthetic preference is a fundamental task related to human co...

Exploring Online Ad Images Using a Deep Convolutional Neural Network Approach

Online advertising is a huge, rapidly growing advertising market in toda...

Deep Spatio-Temporal Neural Networks for Click-Through Rate Prediction

Click-through rate (CTR) prediction is a critical task in online adverti...

Image Matters: Jointly Train Advertising CTR Model with Image Representation of Ad and User Behavior

Click Through Rate(CTR) prediction is vital for online advertising syste...

Multi-Manifold Learning for Large-scale Targeted Advertising System

Messenger advertisements (ads) give direct and personal user experience ...

The New Modality: Emoji Challenges in Prediction, Anticipation, and Retrieval

Over the past decade, emoji have emerged as a new and widespread form of...