Measuring Machine Intelligence Through Visual Question Answering

08/31/2016
by   C. Lawrence Zitnick, et al.
0

As machines have become more intelligent, there has been a renewed interest in methods for measuring their intelligence. A common approach is to propose tasks for which a human excels, but one which machines find difficult. However, an ideal task should also be easy to evaluate and not be easily gameable. We begin with a case study exploring the recently popular task of image captioning and its limitations as a task for measuring machine intelligence. An alternative and more promising task is Visual Question Answering that tests a machine's ability to reason about language and vision. We describe a dataset unprecedented in size created for the task that contains over 760,000 human generated questions about images. Using around 10 million human generated answers, machines may be easily evaluated.

READ FULL TEXT

page 2

page 3

page 4

page 5

research
03/29/2018

Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering

Human conversation is a complex mechanism with subtle nuances. It is hen...
research
09/21/2016

The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)

Visual Question Answering (VQA) task has showcased a new stage of intera...
research
11/19/2015

Dynamic Adaptive Network Intelligence

Accurate representational learning of both the explicit and implicit rel...
research
04/26/2023

HeySQuAD: A Spoken Question Answering Dataset

Human-spoken questions are critical to evaluating the performance of spo...
research
02/24/2022

Measuring CLEVRness: Blackbox testing of Visual Reasoning Models

How can we measure the reasoning capabilities of intelligence systems? V...
research
01/14/2015

Hard to Cheat: A Turing Test based on Answering Questions about Images

Progress in language and image understanding by machines has sparkled th...
research
11/14/2022

What Images are More Memorable to Machines?

This paper studies the problem of measuring and predicting how memorable...

Please sign up or login with your details

Forgot password? Click here to reset