BreakingNews: Article Annotation by Image and Text Processing

03/23/2016
by   Arnau Ramisa, et al.
0

Building upon recent Deep Neural Network architectures, current approaches lying in the intersection of computer vision and natural language processing have achieved unprecedented breakthroughs in tasks like automatic captioning or image retrieval. Most of these learning methods, though, rely on large training sets of images associated with human annotations that specifically describe the visual content. In this paper we propose to go a step further and explore the more complex cases where textual descriptions are loosely related to the images. We focus on the particular domain of News articles in which the textual content often expresses connotative and ambiguous relations that are only suggested but not directly inferred from images. We introduce new deep learning methods that address source detection, popularity prediction, article illustration and geolocation of articles. An adaptive CNN architecture is proposed, that shares most of the structure for all the tasks, and is suitable for multitask and transfer learning. Deep Canonical Correlation Analysis is deployed for article illustration, and a new loss function based on Great Circle Distance is proposed for geolocation. Furthermore, we present BreakingNews, a novel dataset with approximately 100K news articles including images, text and captions, and enriched with heterogeneous meta-data (such as GPS coordinates and popularity metrics). We show this dataset to be appropriate to explore all aforementioned problems, for which we provide a baseline performance using various Deep Learning architectures, and different representations of the textual and visual features. We report very promising results and bring to light several limitations of current state-of-the-art in this kind of domain, which we hope will help spur progress in the field.

READ FULL TEXT

page 2

page 10

page 21

research
01/05/2023

ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions

Advancements in Text-to-Image synthesis over recent years have focused m...
research
06/03/2017

Order embeddings and character-level convolutions for multimodal alignment

With the novel and fast advances in the area of deep neural networks, se...
research
07/26/2022

NewsStories: Illustrating articles with visual summaries

Recent self-supervised approaches have used large-scale image-text datas...
research
04/21/2023

Text2Time: Transformer-based Article Time Period Prediction

The task of predicting the publication period of text documents, such as...
research
09/10/2018

Multi-view Models for Political Ideology Detection of News Articles

A news article's title, content and link structure often reveal its poli...
research
03/08/2023

Automatic Detection of Industry Sectors in Legal Articles Using Machine Learning Approaches

The ability to automatically identify industry sector coverage in articl...
research
02/15/2021

Identifying Misinformation from Website Screenshots

Can the look and the feel of a website give information about the trustw...

Please sign up or login with your details

Forgot password? Click here to reset