Multimodal Learning with Transformers: A Survey

06/13/2022
by   Peng Xu, et al.
21

Transformer is a promising neural network learner, and has achieved great success in various machine learning tasks. Thanks to the recent prevalence of multimodal applications and big data, Transformer-based multimodal learning has become a hot topic in AI research. This paper presents a comprehensive survey of Transformer techniques oriented at multimodal data. The main contents of this survey include: (1) a background of multimodal learning, Transformer ecosystem, and the multimodal big data era, (2) a theoretical review of Vanilla Transformer, Vision Transformer, and multimodal Transformers, from a geometrically topological perspective, (3) a review of multimodal Transformer applications, via two important paradigms, i.e., for multimodal pretraining and for specific multimodal tasks, (4) a summary of the common challenges and designs shared by the multimodal Transformer models and applications, and (5) a discussion of open problems and potential research directions for the community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2021

A Survey of Transformers

Transformers have achieved great success in many artificial intelligence...
research
02/17/2022

Transformer for Graphs: An Overview from Architecture Perspective

Recently, Transformer model, which has achieved great success in many ar...
research
10/12/2022

Foundation Transformers

A big convergence of model architectures across language, vision, speech...
research
09/25/2018

A Survey of Learning Causality with Data: Problems and Methods

The era of big data provides researchers with convenient access to copio...
research
09/18/2021

Multimodal Classification: Current Landscape, Taxonomy and Future Directions

Multimodal classification research has been gaining popularity in many d...
research
06/20/2022

M M Mix: A Multimodal Multiview Transformer Ensemble

This report describes the approach behind our winning solution to the 20...
research
09/06/2022

Fusion of Satellite Images and Weather Data with Transformer Networks for Downy Mildew Disease Detection

Crop diseases significantly affect the quantity and quality of agricultu...

Please sign up or login with your details

Forgot password? Click here to reset