Sort Story: Sorting Jumbled Images and Captions into Stories

06/23/2016
by   Harsh Agrawal, et al.
0

Temporal common sense has applications in AI tasks such as QA, multi-document summarization, and human-AI communication. We propose the task of sequencing -- given a jumbled set of aligned image-caption pairs that belong to a story, the task is to sort them such that the output sequence forms a coherent story. We present multiple approaches, via unary (position) and pairwise (order) predictions, and their ensemble-based combinations, achieving strong results on this task. We use both text-based and image-based features, which depict complementary improvements. Using qualitative examples, we demonstrate that our models have learnt interesting aspects of temporal common sense.

READ FULL TEXT

page 1

page 7

research
05/22/2023

Album Storytelling with Iterative Story-aware Captioning and Large Language Models

This work studies how to transform an album to vivid and coherent storie...
research
01/20/2023

Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences

Current work on image-based story generation suffers from the fact that ...
research
07/18/2019

WriterForcing: Generating more interesting story endings

We study the problem of generating interesting endings for stories. Neur...
research
08/04/2021

Towards Coherent Visual Storytelling with Ordered Image Attention

We address the problem of visual storytelling, i.e., generating a story ...
research
07/14/2014

Non-Monotonic Reasoning and Story Comprehension

This paper develops a Reasoning about Actions and Change framework integ...
research
10/31/2018

Picking Apart Story Salads

During natural disasters and conflicts, information about what happened ...
research
08/02/2022

How UMass-FSD Inadvertently Leverages Temporal Bias

First Story Detection describes the task of identifying new events in a ...

Please sign up or login with your details

Forgot password? Click here to reset