Zhihao Fan

research

∙ 05/16/2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

Diffusion models have gained significant attention in the realm of image...

1 Tong Wu, et al. ∙

research

∙ 01/21/2023

Unifying Structure Reasoning and Language Model Pre-training for Complex Reasoning

Recent knowledge enhanced pre-trained language models have shown remarka...

0 Siyuan Wang, et al. ∙

research

∙ 12/22/2022

GENIE: Large Scale Pre-training for Text Generation with Diffusion Model

In this paper, we propose a large-scale language pre-training for text G...

1 Zhenghao Lin, et al. ∙

research

∙ 08/22/2022

Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering

Multi-hop reasoning requires aggregating multiple documents to answer a ...

0 Siyuan Wang, et al. ∙

research

∙ 06/11/2022

A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training

Multi-modal pre-training and knowledge discovery are two important resea...

0 Zhihao Fan, et al. ∙

research

∙ 01/29/2022

MVPTR: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment

In this paper, we propose a Multi-stage Vision-language Pre-TRaining (MV...

0 Zejun Li, et al. ∙

research

∙ 11/05/2021

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval

Matching model is essential for Image-Text Retrieval framework. Existing...

16 Zhihao Fan, et al. ∙

research

∙ 09/24/2021

Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations

We study the problem of coarse-grained response selection in retrieval-b...

0 Wei Chen, et al. ∙

research

∙ 09/12/2021

Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval

Existing research for image text retrieval mainly relies on sentence-lev...

8 Zhihao Fan, et al. ∙

research

∙ 06/21/2021

TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning

Existing research for image captioning usually represents an image using...

0 Zhihao Fan, et al. ∙

research

∙ 03/25/2021

Mask Attention Networks: Rethinking and Strengthen Transformer

Transformer is an attention-based neural network, which consists of two ...

0 Zhihao Fan, et al. ∙

research

∙ 03/21/2021

An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information

In this paper, we focus on the problem of unsupervised image-sentence ma...

0 Zejun Li, et al. ∙

research

∙ 12/01/2020

An Enhanced Knowledge Injection Model for Commonsense Generation

Commonsense generation aims at generating plausible everyday scenario de...

0 Zhihao Fan, et al. ∙

Zhihao Fan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro