AIGC for Various Data Modalities: A Survey

08/27/2023
by   Lin Geng Foo, et al.
0

AI-generated content (AIGC) methods aim to produce text, images, videos, 3D assets, and other media using AI algorithms. Due to its wide range of applications and the demonstrated potential of recent works, AIGC developments have been attracting a lot of attention recently, and AIGC methods have been developed for various data modalities, such as image, video, text, 3D shape (as voxels, point clouds, meshes, and neural implicit fields), 3D scene, 3D human avatar (body and head), 3D motion, and audio – each presenting different characteristics and challenges. Furthermore, there have also been many significant developments in cross-modality AIGC methods, where generative methods can receive conditioning input in one modality and produce outputs in another. Examples include going from various modalities to image, video, 3D shape, 3D scene, 3D avatar (body and head), 3D motion (skeleton and avatar), and audio modalities. In this paper, we provide a comprehensive review of AIGC methods across different data modalities, including both single-modal and cross-modality methods, highlighting the various challenges, representative works, and recent technical directions in each setting. We also present comparative results on several benchmark datasets in various modalities. Moreover, we also discuss the challenges and potential future research directions.

READ FULL TEXT

page 3

page 10

research
12/22/2020

Human Action Recognition from Various Data Modalities: A Review

Human Action Recognition (HAR), aiming to understand human behaviors and...
research
07/20/2023

Human Motion Generation: A Survey

Human motion generation aims to generate natural human pose sequences an...
research
09/15/2023

Breathing New Life into 3D Assets with Generative Repainting

Diffusion-based text-to-image models ignited immense attention from the ...
research
01/05/2016

Space-Time Representation of People Based on 3D Skeletal Data: A Review

Spatiotemporal human representation based on 3D visual perception data i...
research
03/13/2021

A Survey on Multimodal Disinformation Detection

Recent years have witnessed the proliferation of fake news, propaganda, ...
research
05/22/2023

Multimodal Automated Fact-Checking: A Survey

Misinformation, i.e. factually incorrect information, is often conveyed ...
research
02/07/2023

Combating Online Misinformation Videos: Characterization, Detection, and Future Directions

With information consumption via online video streaming becoming increas...

Please sign up or login with your details

Forgot password? Click here to reset