Computer Vision and Conflicting Values: Describing People with Automated Alt Text

05/26/2021
by   Margot Hanley, et al.
0

Scholars have recently drawn attention to a range of controversial issues posed by the use of computer vision for automatically generating descriptions of people in images. Despite these concerns, automated image description has become an important tool to ensure equitable access to information for blind and low vision people. In this paper, we investigate the ethical dilemmas faced by companies that have adopted the use of computer vision for producing alt text: textual descriptions of images for blind and low vision people, We use Facebook's automatic alt text tool as our primary case study. First, we analyze the policies that Facebook has adopted with respect to identity categories, such as race, gender, age, etc., and the company's decisions about whether to present these terms in alt text. We then describe an alternative – and manual – approach practiced in the museum community, focusing on how museums determine what to include in alt text descriptions of cultural artifacts. We compare these policies, using notable points of contrast to develop an analytic framework that characterizes the particular apprehensions behind these policy choices. We conclude by considering two strategies that seem to sidestep some of these concerns, finding that there are no easy ways to avoid the normative dilemmas posed by the use of computer vision to automate alt text.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2021

Ethics and Creativity in Computer Vision

This paper offers a retrospective of what we learnt from organizing the ...
research
02/25/2022

se-Shweshwe Inspired Fashion Generation

Fashion is one of the ways in which we show ourselves to the world. It i...
research
02/16/2015

Image Specificity

For some images, descriptions written by multiple people are consistent ...
research
12/16/2019

Towards Fairer Datasets: Filtering and Balancing the Distribution of the People Subtree in the ImageNet Hierarchy

Computer vision technology is being used by many but remains representat...
research
03/27/2021

Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review

Research in the area of Vision and Language encompasses challenging topi...
research
06/24/2020

Large image datasets: A pyrrhic win for computer vision?

In this paper we investigate problematic practices and consequences of l...
research
11/29/2016

Photographic home styles in Congress: a computer vision approach

While members of Congress now routinely communicate with constituents us...

Please sign up or login with your details

Forgot password? Click here to reset