This paper proposes a new method, OFA-OCR, to transfer multimodal pretra...
Generalist models, which are capable of performing diverse multi-modal t...
The tremendous success of CLIP (Radford et al., 2021) has promoted the
r...
Table-to-text generation refers to generating a descriptive text from a
...
In this work, we construct the largest dataset for multimodal pretrainin...
Pre-trained language models have been applied to various NLP tasks with
...
Long text generation is an important but challenging task.The main probl...
Multi-modal pretraining for learning high-level multi-modal representati...
In this paper, we propose a novel end-to-end framework called KBRD, whic...
Quality product descriptions are critical for providing competitive cust...