Ethical Considerations for Collecting Human-Centric Image Datasets

02/07/2023
by   Jerone T. A. Andrews, et al.
5

Human-centric image datasets are critical to the development of computer vision technologies. However, recent investigations have foregrounded significant ethical issues related to privacy and bias, which have resulted in the complete retraction, or modification, of several prominent datasets. Recent works have tried to reverse this trend, for example, by proposing analytical frameworks for ethically evaluating datasets, the standardization of dataset documentation and curation practices, privacy preservation methodologies, as well as tools for surfacing and mitigating representational biases. Little attention, however, has been paid to the realities of operationalizing ethical data collection. To fill this gap, we present a set of key ethical considerations and practical recommendations for collecting more ethically-minded human-centric image data. Our research directly addresses issues of privacy and bias by contributing to the research community best practices for ethical data collection, covering purpose, privacy and consent, as well as diversity. We motivate each consideration by drawing on lessons from current practices, dataset withdrawals and audits, and analytical ethical frameworks. Our research is intended to augment recent scholarship, representing an important step toward more responsible data curation practices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2020

An Ethical Highlighter for People-Centric Dataset Creation

Important ethical concerns arising from computer vision datasets of peop...
research
03/17/2021

Best Practices for Collecting Gender and Sex Data

The measurement and analysis of human sex and gender is a nuanced proble...
research
05/03/2023

Considerations for Ethical Speech Recognition Datasets

Speech AI Technologies are largely trained on publicly available dataset...
research
08/06/2021

Mitigating dataset harms requires stewardship: Lessons from 1000 papers

Concerns about privacy, bias, and harmful applications have shone a ligh...
research
06/12/2022

Don't "research fast and break things": On the ethics of Computational Social Science

This article is concerned with setting up practical guardrails within th...
research
08/24/2021

Sharing Practices for Datasets Related to Accessibility and Aging

Datasets sourced from people with disabilities and older adults play an ...
research
04/03/2022

Data Cards: Purposeful and Transparent Dataset Documentation for Responsible AI

As research and industry moves towards large-scale models capable of num...

Please sign up or login with your details

Forgot password? Click here to reset