Human or Machine? Turing Tests for Vision and Language

11/23/2022
by   Mengmi Zhang, et al.
0

As AI algorithms increasingly participate in daily activities that used to be the sole province of humans, we are inevitably called upon to consider how much machines are really like us. To address this question, we turn to the Turing test and systematically benchmark current AIs in their abilities to imitate humans. We establish a methodology to evaluate humans versus machines in Turing-like tests and systematically evaluate a representative set of selected domains, parameters, and variables. The experiments involved testing 769 human agents, 24 state-of-the-art AI agents, 896 human judges, and 8 AI judges, in 21,570 Turing tests across 6 tasks encompassing vision and language modalities. Surprisingly, the results reveal that current AIs are not far from being able to impersonate human judges across different ages, genders, and educational levels in complex visual and language challenges. In contrast, simple AI judges outperform human judges in distinguishing human answers versus machine answers. The curated large-scale Turing test datasets introduced here and their evaluation metrics provide valuable insights to assess whether an agent is human or not. The proposed formulation to benchmark human imitation ability in current AIs paves a way for the research community to expand Turing tests to other research areas and conditions. All of source code and data are publicly available at https://tinyurl.com/8x8nha7p

READ FULL TEXT

page 18

page 19

page 20

page 21

page 35

page 36

page 37

page 38

research
01/07/2021

A design of human-like robust AI machines in object identification

This is a perspective paper inspired from the study of Turing Test propo...
research
06/09/2019

There is no general AI: Why Turing machines cannot pass the Turing test

Since 1950, when Alan Turing proposed what has since come to be called t...
research
10/29/2014

Towards a Visual Turing Challenge

As language and visual understanding by machines progresses rapidly, we ...
research
09/16/2023

A Statistical Turing Test for Generative Models

The emergence of human-like abilities of AI systems for content generati...
research
06/24/2022

A Test for Evaluating Performance in Human-Computer Systems

The Turing test for comparing computer performance to that of humans is ...
research
11/15/2019

A Turing Test for Crowds

The realism and believability of crowd simulations underpins computation...
research
02/04/2014

User Friendly Line CAPTCHAs

CAPTCHAs or reverse Turing tests are real-time assessments used by progr...

Please sign up or login with your details

Forgot password? Click here to reset