Challenge AI Mind: A Crowd System for Proactive AI Testing

10/21/2018
by   Siwei Fu, et al.
6

Artificial Intelligence (AI) has burrowed into our lives in various aspects; however, without appropriate testing, deployed AI systems are often being criticized to fail in critical and embarrassing cases. Existing testing approaches mainly depend on fixed and pre-defined datasets, providing a limited testing coverage. In this paper, we propose the concept of proactive testing to dynamically generate testing data and evaluate the performance of AI systems. We further introduce Challenge.AI, a new crowd system that features the integration of crowdsourcing and machine learning techniques in the process of error generation, error validation, error categorization, and error analysis. We present experiences and insights into a participatory design with AI developers. The evaluation shows that the crowd workflow is more effective with the help of machine learning techniques. AI developers found that our system can help them discover unknown errors made by the AI models, and engage in the process of proactive testing.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 8

page 9

page 10

page 11

research
12/10/2019

Datamorphic Testing: A Methodology for Testing AI Applications

With the rapid growth of the applications of machine learning (ML) and o...
research
01/14/2022

Tools and Practices for Responsible AI Engineering

Responsible Artificial Intelligence (AI) - the practice of developing, e...
research
08/22/2018

Increasing Trust in AI Services through Supplier's Declarations of Conformity

The accuracy and reliability of machine learning algorithms are an impor...
research
09/02/2020

Estimating the Brittleness of AI: Safety Integrity Levels and the Need for Testing Out-Of-Distribution Performance

Test, Evaluation, Verification, and Validation (TEVV) for Artificial Int...
research
08/07/2023

Advancements In Crowd-Monitoring System: A Comprehensive Analysis of Systematic Approaches and Automation Algorithms: State-of-The-Art

Growing apprehensions surrounding public safety have captured the attent...
research
07/31/2023

Crowd Safety Manager: Towards Data-Driven Active Decision Support for Planning and Control of Crowd Events

This paper presents novel technology and methodology aimed at enhancing ...
research
02/11/2021

Testing Framework for Black-box AI Models

With widespread adoption of AI models for important decision making, ens...

Please sign up or login with your details

Forgot password? Click here to reset