VisualTextRank: Unsupervised Graph-based Content Extraction for Automating Ad Text to Image Search

08/05/2021
by   Shaunak Mishra, et al.
0

Numerous online stock image libraries offer high quality yet copyright free images for use in marketing campaigns. To assist advertisers in navigating such third party libraries, we study the problem of automatically fetching relevant ad images given the ad text (via a short textual query for images). Motivated by our observations in logged data on ad image search queries (given ad text), we formulate a keyword extraction problem, where a keyword extracted from the ad text (or its augmented version) serves as the ad image query. In this context, we propose VisualTextRank: an unsupervised method to (i) augment input ad text using semantically similar ads, and (ii) extract the image query from the augmented ad text. VisualTextRank builds on prior work on graph based context extraction (biased TextRank in particular) by leveraging both the text and image of similar ads for better keyword extraction, and using advertiser category specific biasing with sentence-BERT embeddings. Using data collected from the Verizon Media Native (Yahoo Gemini) ad platform's stock image search feature for onboarding advertisers, we demonstrate the superiority of VisualTextRank compared to competitive keyword extraction baselines (including an 11% accuracy lift over biased TextRank). For the case when the stock image library is restricted to English queries, we show the effectiveness of VisualTextRank on multilingual ads (translated to English) while leveraging semantically similar English ads. Online tests with a simplified version of VisualTextRank led to a 28.7 a 41.6 ad platform.

READ FULL TEXT

page 2

page 4

research
08/18/2021

TSI: an Ad Text Strength Indicator using Text-to-CTR and Semantic-Ad-Similarity

Coming up with effective ad text is a time consuming process, and partic...
research
01/19/2023

Keyword Embeddings for Query Suggestion

Nowadays, search engine users commonly rely on query suggestions to impr...
research
08/17/2020

Learning to Create Better Ads: Generation and Ranking Approaches for Ad Creative Refinement

In the online advertising industry, the process of designing an ad creat...
research
11/02/2020

Biased TextRank: Unsupervised Graph-Based Content Extraction

We introduce Biased TextRank, a graph-based content extraction method in...
research
12/19/2022

Graph-based Semantical Extractive Text Analysis

In the past few decades, there has been an explosion in the amount of av...
research
02/08/2021

Empowering Investigative Journalism with Graph-based Heterogeneous Data Management

Investigative Journalism (IJ, in short) is staple of modern, democratic ...
research
11/17/2020

Towards Olfactory Information Extraction from Text: A Case Study on Detecting Smell Experiences in Novels

Environmental factors determine the smells we perceive, but societal fac...

Please sign up or login with your details

Forgot password? Click here to reset