b'Feilong Chen'

research

∙ 05/31/2023

ViLaS: Integrating Vision and Language into Automatic Speech Recognition

Employing additional multimodal information to improve automatic speech ...

0 Minglun Han, et al. ∙

research

∙ 05/07/2023

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Large language models (LLMs) have demonstrated remarkable language abili...

0 Feilong Chen, et al. ∙

research

∙ 01/30/2023

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation

Large-scale pre-trained language models (PLMs) with powerful language mo...

0 Minglun Han, et al. ∙

research

∙ 05/24/2022

HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval

In the past few years, the emergence of vision-language pre-training (VL...

0 Feilong Chen, et al. ∙

research

∙ 04/15/2022

Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning

Visual Dialog is a challenging vision-language task since the visual dia...

0 Feilong Chen, et al. ∙

research

∙ 02/18/2022

VLP: A Survey on Vision-Language Pre-training

In the past few years, the emergence of pre-training models has brought ...

0 Feilong Chen, et al. ∙

research

∙ 09/17/2021

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation

Visual dialogue is a challenging task since it needs to answer a series ...

0 Feilong Chen, et al. ∙

research

∙ 09/17/2021

GoG: Relation-aware Graph-over-Graph Network for Visual Dialog

Visual dialog, which aims to hold a meaningful conversation with humans ...

0 Feilong Chen, et al. ∙

research

∙ 09/13/2021

Learning to Ground Visual Objects for Visual Dialog

Visual dialog is challenging since it needs to answer a series of cohere...

0 Feilong Chen, et al. ∙

research

∙ 12/18/2019

DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog

Visual Dialog is a vision-language task that requires an AI agent to eng...

0 Feilong Chen, et al. ∙

Feilong Chen

Featured Co-authors

Sign in with Google

Consider DeepAI Pro