Multimodal Side-Tuning for Document Classification

01/16/2023
by   Stefano Pio Zingaro, et al.
11

In this paper, we propose to exploit the side-tuning framework for multimodal document classification. Side-tuning is a methodology for network adaptation recently introduced to solve some of the problems related to previous approaches. Thanks to this technique it is actually possible to overcome model rigidity and catastrophic forgetting of transfer learning by fine-tuning. The proposed solution uses off-the-shelf deep learning architectures leveraging the side-tuning framework to combine a base model with a tandem of two side networks. We show that side-tuning can be successfully employed also when different data sources are considered, e.g. text and images in document classification. The experimental results show that this approach pushes further the limit for document classification accuracy with respect to the state of the art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2018

On transfer learning using a MAC model variant

We introduce a variant of the MAC model (Hudson and Manning, CVPR 2018) ...
research
05/30/2023

AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuning

Entity Matching (EM) involves identifying different data representations...
research
09/19/2023

Investigating the Catastrophic Forgetting in Multimodal Large Language Models

Following the success of GPT4, there has been a surge in interest in mul...
research
08/09/2021

Transfer Learning Gaussian Anomaly Detection by Fine-Tuning Representations

Current state-of-the-art Anomaly Detection (AD) methods exploit the powe...
research
02/24/2019

Medical Multimodal Classifiers Under Scarce Data Condition

Data is one of the essential ingredients to power deep learning research...
research
05/22/2022

muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems

Most uses of machine learning today involve training a model from scratc...
research
07/18/2023

Multimodal Machine Learning for Extraction of Theorems and Proofs in the Scientific Literature

Scholarly articles in mathematical fields feature mathematical statement...

Please sign up or login with your details

Forgot password? Click here to reset