Visually Grounded Continual Learning of Compositional Semantics

05/02/2020
by   Xisen Jin, et al.
0

Children's language acquisition from the visual world is a real-world example of continual learning from dynamic and evolving environments; yet we lack a realistic setup to study neural networks' capability in human-like language acquisition. In this paper, we propose a realistic setup by simulating children's language acquisition process. We formulate language acquisition as a masked language modeling task where the model visits a stream of data with continuously shifting distribution. Our training and evaluation encode two important challenges in human's language learning, namely the continual learning and the compositionality. We show the performance of existing continual learning algorithms is far from satisfactory. We also study the interactions between memory based continual learning algorithms and compositional generalization and conclude that overcoming overfitting and compositional overfitting may be crucial for a good performance in our problem setup. Our code and data can be found at https://github.com/INK-USC/VG-CCL.

READ FULL TEXT
research
04/24/2023

Renate: A Library for Real-World Continual Learning

Continual learning enables the incremental training of machine learning ...
research
07/05/2023

Human Inspired Progressive Alignment and Comparative Learning for Grounded Word Acquisition

Human language acquisition is an efficient, supervised, and continual pr...
research
09/16/2021

Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments

Continual learning refers to the ability of humans and animals to increm...
research
05/05/2021

ADAM: A Sandbox for Implementing Language Learning

We present ADAM, a software system for designing and running child langu...
research
06/01/2022

Label-Efficient Online Continual Object Detection in Streaming Video

To thrive in evolving environments, humans are capable of continual acqu...
research
08/14/2023

CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation

Vision-Language Pretraining (VLP) has shown impressive results on divers...
research
07/24/2020

Mind Your Manners! A Dataset and A Continual Learning Approach for Assessing Social Appropriateness of Robot Actions

To date, endowing robots with an ability to assess social appropriatenes...

Please sign up or login with your details

Forgot password? Click here to reset