Yichang Zhang

research

∙ 12/19/2022

Transferring General Multimodal Pretrained Models to Text Recognition

This paper proposes a new method, OFA-OCR, to transfer multimodal pretra...

0 Junyang Lin, et al. ∙

research

∙ 12/08/2022

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models

Generalist models, which are capable of performing diverse multi-modal t...

0 Jinze Bai, et al. ∙

research

∙ 11/02/2022

Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese

The tremendous success of CLIP (Radford et al., 2021) has promoted the r...

0 An Yang, et al. ∙

research

∙ 05/31/2021

Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation

Table-to-text generation refers to generating a descriptive text from a ...

0 Peng Wang, et al. ∙

research

∙ 03/01/2021

M6: A Chinese Multimodal Pretrainer

In this work, we construct the largest dataset for multimodal pretrainin...

14 Junyang Lin, et al. ∙

research

∙ 12/02/2020

Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains

Pre-trained language models have been applied to various NLP tasks with ...

0 Haojie Pan, et al. ∙

research

∙ 09/28/2020

Graph-based Multi-hop Reasoning for Long Text Generation

Long text generation is an important but challenging task.The main probl...

0 Liang Zhao, et al. ∙

research

∙ 03/30/2020

InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining

Multi-modal pretraining for learning high-level multi-modal representati...

0 Junyang Lin, et al. ∙

research

∙ 08/15/2019

Towards Knowledge-Based Recommender Dialog System

In this paper, we propose a novel end-to-end framework called KBRD, whic...

0 Qibin Chen, et al. ∙

research

∙ 03/29/2019

Towards Knowledge-Based Personalized Product Description Generation in E-commerce

Quality product descriptions are critical for providing competitive cust...

0 Qibin Chen, et al. ∙

Yichang Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro