Visual Intelligence through Human Interaction

11/12/2021
by   Ranjay Krishna, et al.
14

Over the last decade, Computer Vision, the branch of Artificial Intelligence aimed at understanding the visual world, has evolved from simply recognizing objects in images to describing pictures, answering questions about images, aiding robots maneuver around physical spaces and even generating novel visual content. As these tasks and applications have modernized, so too has the reliance on more data, either for model training or for evaluation. In this chapter, we demonstrate that novel interaction strategies can enable new forms of data collection and evaluation for Computer Vision. First, we present a crowdsourcing interface for speeding up paid data collection by an order of magnitude, feeding the data-hungry nature of modern vision models. Second, we explore a method to increase volunteer contributions using automated social interventions. Third, we develop a system to ensure human evaluation of generative vision models are reliable, affordable and grounded in psychophysics theory. We conclude with future opportunities for Human-Computer Interaction to aid Computer Vision.

READ FULL TEXT

page 10

page 22

page 36

research
11/07/2016

Crowdsourcing in Computer Vision

Computer vision systems require large amounts of manually annotated data...
research
11/19/2021

Sketch-based Creativity Support Tools using Deep Learning

Sketching is a natural and effective visual communication medium commonl...
research
07/11/2019

MeetUp! A Corpus of Joint Activity Dialogues in a Visual Environment

Building computer systems that can converse about their visual environme...
research
04/12/2016

Attributes as Semantic Units between Natural Language and Visual Recognition

Impressive progress has been made in the fields of computer vision and n...
research
05/03/2018

InceptB: A CNN Based Classification Approach for Recognizing Traditional Bengali Games

Sports activities are an integral part of our day to day life. Introduci...
research
08/16/2023

Flickr Africa: Examining Geo-Diversity in Large-Scale, Human-Centric Visual Data

Biases in large-scale image datasets are known to influence the performa...
research
12/18/2018

DeepLens: Towards a Visual Data Management System

Advances in deep learning have greatly widened the scope of automatic co...

Please sign up or login with your details

Forgot password? Click here to reset