Despite the excellent performance of large-scale vision-language pre-tra...
Despite the remarkable success of pre-trained language models (PLMs), th...
Visual Question Answering (VQA) models are prone to learn the shortcut
s...
Models for Visual Question Answering (VQA) often rely on the spurious
co...
Recent studies on the lottery ticket hypothesis (LTH) show that pre-trai...
Recently, knowledge distillation (KD) has shown great success in BERT
co...
Pre-trained language models of the BERT family have defined the
state-of...
Zero-shot intent detection (ZSID) aims to deal with the continuously eme...
Recently, attention-based encoder-decoder models have been used extensiv...
Recently, unsupervised pre-training is gaining increasing popularity in ...
In image-grounded text generation, fine-grained representations of the i...
The encode-decoder framework has shown recent success in image captionin...