MultiZoo MultiBench: A Standardized Toolkit for Multimodal Deep Learning

06/28/2023
by   Paul Pu Liang, et al.
0

Learning multimodal representations involves integrating information from multiple heterogeneous sources of data. In order to accelerate progress towards understudied modalities and tasks while ensuring real-world robustness, we release MultiZoo, a public toolkit consisting of standardized implementations of > 20 core multimodal algorithms and MultiBench, a large-scale benchmark spanning 15 datasets, 10 modalities, 20 prediction tasks, and 6 research areas. Together, these provide an automated end-to-end machine learning pipeline that simplifies and standardizes data loading, experimental setup, and model evaluation. To enable holistic evaluation, we offer a comprehensive methodology to assess (1) generalization, (2) time and space complexity, and (3) modality robustness. MultiBench paves the way towards a better understanding of the capabilities and limitations of multimodal models, while ensuring ease of use, accessibility, and reproducibility. Our toolkits are publicly available, will be regularly updated, and welcome inputs from the community.

READ FULL TEXT
research
07/15/2021

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning

Learning multimodal representations involves integrating information fro...
research
02/04/2018

End2You -- The Imperial Toolkit for Multimodal Profiling by End-to-End Learning

We introduce End2You -- the Imperial College London toolkit for multimod...
research
06/30/2022

MultiViz: An Analysis Benchmark for Visualizing and Understanding Multimodal Models

The promise of multimodal models for real-world applications has inspire...
research
03/02/2022

HighMMT: Towards Modality and Task Generalization for High-Modality Representation Learning

Learning multimodal representations involves discovering correspondences...
research
04/10/2023

On Robustness in Multimodal Learning

Multimodal learning is defined as learning over multiple heterogeneous i...
research
09/20/2022

FACT: Learning Governing Abstractions Behind Integer Sequences

Integer sequences are of central importance to the modeling of concepts ...
research
10/26/2022

Using multimodal learning and deep generative models for corporate bankruptcy prediction

This research introduces for the first time the concept of multimodal le...

Please sign up or login with your details

Forgot password? Click here to reset