Evaluating CLIP: Towards Characterization of Broader Capabilities and Downstream Implications

08/05/2021
by   Sandhini Agarwal, et al.
0

Recently, there have been breakthroughs in computer vision ("CV") models that are more generalizable with the advent of models such as CLIP and ALIGN. In this paper, we analyze CLIP and highlight some of the challenges such models pose. CLIP reduces the need for task specific training data, potentially opening up many niche tasks to automation. CLIP also allows its users to flexibly specify image classification classes in natural language, which we find can shift how biases manifest. Additionally, through some preliminary probes we find that CLIP can inherit biases found in prior computer vision systems. Given the wide and unpredictable domain of uses for such models, this raises questions regarding what sufficiently safe behaviour for such systems may look like. These results add evidence to the growing body of work calling for a change in the notion of a 'better' model–to move beyond simply looking at higher accuracy at task-oriented capability evaluations, and towards a broader 'better' that takes into account deployment-critical features such as different use contexts, and people who interact with the model when thinking about model deployment.

READ FULL TEXT
research
02/16/2023

Tuning computer vision models with task rewards

Misalignment between model predictions and intended usage can be detrime...
research
03/30/2023

A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision

There has been a recent explosion of computer vision models which perfor...
research
02/15/2022

Fairness Indicators for Systematic Assessments of Visual Feature Extractors

Does everyone equally benefit from computer vision systems? Answers to t...
research
09/05/2023

Tidying Up the Conversational Recommender Systems' Biases

The growing popularity of language models has sparked interest in conver...
research
04/11/2023

Toxicity in ChatGPT: Analyzing Persona-assigned Language Models

Large language models (LLMs) have shown incredible capabilities and tran...
research
12/10/2018

Studying oppressive cityscapes of Bangladesh

In a densely populated city like Dhaka (Bangladesh), a growing number of...

Please sign up or login with your details

Forgot password? Click here to reset