On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research

04/24/2023
by   Luiza Pozzobon, et al.
6

Perception of toxicity evolves over time and often differs between geographies and cultural backgrounds. Similarly, black-box commercially available APIs for detecting toxicity, such as the Perspective API, are not static, but frequently retrained to address any unattended weaknesses and biases. We evaluate the implications of these changes on the reproducibility of findings that compare the relative merits of models and methods that aim to curb toxicity. Our findings suggest that research that relied on inherited automatic toxicity scores to compare models and techniques may have resulted in inaccurate findings. Rescoring all models from HELM, a widely respected living benchmark, for toxicity with the recent version of the API led to a different ranking of widely used foundation models. We suggest caution in applying apples-to-apples comparisons between studies and lay recommendations for a more structured approach to evaluating toxicity over time. Code and data are available at https://github.com/for-ai/black-box-api-challenges.

READ FULL TEXT
research
10/21/2020

Black-Box Ripper: Copying black-box models using generative evolutionary algorithms

We study the task of replicating the functionality of black-box neural m...
research
05/11/2020

Spanning Attack: Reinforce Black-box Attacks with Unlabeled Data

Adversarial black-box attacks aim to craft adversarial perturbations by ...
research
12/31/2021

REST API Fuzzing by Coverage Level Guided Blackbox Testing

With the growth of web applications, REST APIs have become the primary c...
research
06/05/2023

InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Large language models (LLMs) are instruction followers, but it can be ch...
research
09/21/2017

Defining a Lingua Franca to Open the Black Box of a Naïve Bayes Recommender

Many AI systems have a black box nature that makes it difficult to under...
research
03/13/2022

Symbolic Learning to Optimize: Towards Interpretability and Scalability

Recent studies on Learning to Optimize (L2O) suggest a promising path to...
research
10/02/2018

An Inductive Logic Programming Approach to Validate Hexose Binding Biochemical Knowledge

Hexoses are simple sugars that play a key role in many cellular pathways...

Please sign up or login with your details

Forgot password? Click here to reset