Contextual Media Retrieval Using Natural Language Queries

02/16/2016
by   Sreyasi Nag Chowdhury, et al.
0

The widespread integration of cameras in hand-held and head-worn devices as well as the ability to share content online enables a large and diverse visual capture of the world that millions of users build up collectively every day. We envision these images as well as associated meta information, such as GPS coordinates and timestamps, to form a collective visual memory that can be queried while automatically taking the ever-changing context of mobile users into account. As a first step towards this vision, in this work we present Xplore-M-Ego: a novel media retrieval system that allows users to query a dynamic database of images and videos using spatio-temporal natural language queries. We evaluate our system using a new dataset of real user queries as well as through a usability study. One key finding is that there is a considerable amount of inter-user variability, for example in the resolution of spatial relations in natural language utterances. We show that our retrieval system can cope with this variability using personalisation through an online learning-based retrieval formulation.

READ FULL TEXT

page 1

page 6

research
02/13/2023

Dataset of Natural Language Queries for E-Commerce

Shopping online is more and more frequent in our everyday life. For e-co...
research
05/30/2019

The Fashion IQ Dataset: Retrieving Images by Combining Side Information and Relative Natural Language Feedback

We contribute a new dataset and a novel method for natural language base...
research
08/30/2023

Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap

We present Text-to-OverpassQL, a task designed to facilitate a natural l...
research
07/02/2017

Where to Play: Retrieval of Video Segments using Natural-Language Queries

In this paper, we propose a new approach for retrieval of video segments...
research
11/08/2022

Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation

People capture photos and videos to relive and share memories of persona...
research
09/05/2017

Cross-Media Similarity Evaluation for Web Image Retrieval in the Wild

In order to retrieve unlabeled images by textual queries, cross-media si...
research
08/01/2019

A Natural-language-based Visual Query Approach of Uncertain Human Trajectories

Visual querying is essential for interactively exploring massive traject...

Please sign up or login with your details

Forgot password? Click here to reset