Which Ads to Show? Advertisement Image Assessment with Auxiliary Information via Multi-step Modality Fusion

10/06/2019
by   Kyung-Wha Park, et al.
0

Assessing aesthetic preference is a fundamental task related to human cognition. It can also contribute to various practical applications such as image creation for online advertisements. Despite crucial influences of image quality, auxiliary information of ad images such as tags and target subjects can also determine image preference. Existing studies mainly focus on images and thus are less useful for advertisement scenarios where rich auxiliary data are available. Here we propose a modality fusion-based neural network that evaluates the aesthetic preference of images with auxiliary information. Our method fully utilizes auxiliary data by introducing multi-step modality fusion using both conditional batch normalization-based low-level and attention-based high-level fusion mechanisms, inspired by the findings from statistical analyses on real advertisement data. Our approach achieved state-of-the-art performance on the AVA dataset, a widely used dataset for aesthetic assessment. Besides, the proposed method is evaluated on large-scale real-world advertisement image data with rich auxiliary attributes, providing promising preference prediction results. Through extensive experiments, we investigate how image and auxiliary information together influence click-through rate.

READ FULL TEXT

page 3

page 7

research
01/31/2021

M2FN: Multi-step Modality Fusion for Advertisement Image Assessment

Assessing advertisements, specifically on the basis of user preferences ...
research
12/18/2019

A Cross-Modal Image Fusion Theory Guided by Human Visual Characteristics

The characteristics of feature selection, nonlinear combination and mult...
research
11/28/2022

Pitfalls of Conditional Batch Normalization for Contextual Multi-Modal Learning

Humans have perfected the art of learning from multiple modalities throu...
research
09/23/2021

Cross Attention-guided Dense Network for Images Fusion

In recent years, various applications in computer vision have achieved s...
research
12/23/2019

Cross-Modal Image Fusion Theory Guided by Subjective Visual Attention

The human visual perception system has very strong robustness and contex...
research
08/15/2020

Cross-Modality Multi-Atlas Segmentation Using Deep Neural Networks

Both image registration and label fusion in the multi-atlas segmentation...
research
12/24/2016

PixelCNN Models with Auxiliary Variables for Natural Image Modeling

We study probabilistic models of natural images and extend the autoregre...

Please sign up or login with your details

Forgot password? Click here to reset