Flickr Africa: Examining Geo-Diversity in Large-Scale, Human-Centric Visual Data

08/16/2023
by   Keziah Naggita, et al.
0

Biases in large-scale image datasets are known to influence the performance of computer vision models as a function of geographic context. To investigate the limitations of standard Internet data collection methods in low- and middle-income countries, we analyze human-centric image geo-diversity on a massive scale using geotagged Flickr images associated with each nation in Africa. We report the quantity and content of available data with comparisons to population-matched nations in Europe as well as the distribution of data according to fine-grained intra-national wealth estimates. Temporal analyses are performed at two-year intervals to expose emerging data trends. Furthermore, we present findings for an “othering” phenomenon as evidenced by a substantial number of images from Africa being taken by non-local photographers. The results of our study suggest that further work is required to capture image data representative of African people and their environments and, ultimately, to improve the applicability of computer vision models in a global context.

READ FULL TEXT

page 2

page 5

page 8

page 13

page 32

page 33

page 34

page 35

research
09/05/2018

BOLD5000: A public fMRI dataset of 5000 images

Vision science, particularly machine vision, has been revolutionized by ...
research
05/27/2020

NDD20: A large-scale few-shot dolphin dataset for coarse and fine-grained categorisation

We introduce the Northumberland Dolphin Dataset 2020 (NDD20), a challeng...
research
12/16/2019

Towards Fairer Datasets: Filtering and Balancing the Distribution of the People Subtree in the ImageNet Hierarchy

Computer vision technology is being used by many but remains representat...
research
02/16/2022

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision

Discriminative self-supervised learning allows training models on any ra...
research
10/03/2018

Image as Data: Automated Visual Content Analysis for Political Science

Image data provide unique information about political events, actors, an...
research
11/12/2021

Visual Intelligence through Human Interaction

Over the last decade, Computer Vision, the branch of Artificial Intellig...
research
11/29/2016

Photographic home styles in Congress: a computer vision approach

While members of Congress now routinely communicate with constituents us...

Please sign up or login with your details

Forgot password? Click here to reset