A very preliminary analysis of DALL-E 2

04/25/2022
by   Gary Marcus, et al.
47

The DALL-E 2 system generates original synthetic images corresponding to an input text as caption. We report here on the outcome of fourteen tests of this system designed to assess its common sense, reasoning and ability to understand complex texts. All of our prompts were intentionally much more challenging than the typical ones that have been showcased in recent weeks. Nevertheless, for 5 out of the 14 prompts, at least one of the ten images fully satisfied our requests. On the other hand, on no prompt did all of the ten images satisfy our requests.

READ FULL TEXT

page 3

page 6

page 8

page 9

page 10

page 11

page 12

page 13

research
07/05/2018

An Insight into the Pull Requests of GitHub

Given the increasing number of unsuccessful pull requests in GitHub proj...
research
03/24/2023

Testability Refactoring in Pull Requests: Patterns and Trends

To create unit tests, it may be necessary to refactor the production cod...
research
03/20/2023

Mind meets machine: Unravelling GPT-4's cognitive psychology

Commonsense reasoning is a basic ingredient of intelligence in humans, e...
research
11/30/2015

Incidental Scene Text Understanding: Recent Progresses on ICDAR 2015 Robust Reading Competition Challenge 4

Different from focused texts present in natural images, which are captur...
research
08/06/2019

Logic could be learned from images

Logic reasoning is a significant ability of human intelligence and also ...
research
12/06/2018

Neural Image Decompression: Learning to Render Better Image Previews

A rapidly increasing portion of Internet traffic is dominated by request...

Please sign up or login with your details

Forgot password? Click here to reset