Increasing trust in new data sources: crowdsourcing image classification for ecology

05/02/2023
by   Edgar Santos-Fernández, et al.
0

Crowdsourcing methods facilitate the production of scientific information by non-experts. This form of citizen science (CS) is becoming a key source of complementary data in many fields to inform data-driven decisions and study challenging problems. However, concerns about the validity of these data often constrain their utility. In this paper, we focus on the use of citizen science data in addressing complex challenges in environmental conservation. We consider this issue from three perspectives. First, we present a literature scan of papers that have employed Bayesian models with citizen science in ecology. Second, we compare several popular majority vote algorithms and introduce a Bayesian item response model that estimates and accounts for participants' abilities after adjusting for the difficulty of the images they have classified. The model also enables participants to be clustered into groups based on ability. Third, we apply the model in a case study involving the classification of corals from underwater images from the Great Barrier Reef, Australia. We show that the model achieved superior results in general and, for difficult tasks, a weighted consensus method that uses only groups of experts and experienced participants produced better performance measures. Moreover, we found that participants learn as they have more classification opportunities, which substantially increases their abilities over time. Overall, the paper demonstrates the feasibility of CS for answering complex and challenging ecological questions when these data are appropriately analysed. This serves as motivation for future work to increase the efficacy and trustworthiness of this emerging source of data.

READ FULL TEXT

page 1

page 10

research
06/01/2020

Correcting misclassification errors in crowdsourced ecological data: A Bayesian perspective

Many research domains use data elicited from "citizen scientists" when a...
research
03/16/2020

Bayesian item response models for citizen science ecological data

So-called citizen science data elicited from crowds has become increasin...
research
08/15/2018

Monitoring through many eyes: Integrating scientific and crowd-sourced datasets to improve monitoring of the Great Barrier Reef

Data in the Great Barrier Reef (GBR) are collected by numerous organisat...
research
06/17/2022

Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts

In this study we investigate to which degree experts and non-experts agr...
research
09/07/2023

Data Type Agnostic Visual Sensitivity Analysis

Modern science and industry rely on computational models for simulation,...
research
11/30/2019

Fooling the Crowd with Deep Learning-based Methods

Modern, state-of-the-art deep learning approaches yield human like perfo...

Please sign up or login with your details

Forgot password? Click here to reset