Jianfeng Wang

research

∙ 08/05/2023

NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation

Semi-supervised semantic segmentation involves assigning pixel-wise labe...

0 Jianfeng Wang, et al. ∙

research

∙ 07/27/2023

Spatial-Frequency U-Net for Denoising Diffusion Probabilistic Models

In this paper, we study the denoising diffusion probabilistic model (DDP...

0 Xin Yuan, et al. ∙

research

∙ 06/26/2023

Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Despite the promising progress in multi-modal tasks, current large multi...

0 Fuxiao Liu, et al. ∙

research

∙ 06/07/2023

MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

Multimodal summarization with multimodal output (MSMO) has emerged as a ...

0 Jielin Qiu, et al. ∙

research

∙ 05/03/2023

Scheduling Network Function Chains Under Sub-Millisecond Latency SLOs

Network Function Virtualization (NFV) seeks to replace hardware middlebo...

0 Jianfeng Wang, et al. ∙

research

∙ 03/20/2023

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

We propose MM-REACT, a system paradigm that integrates ChatGPT with a po...

0 Zhengyuan Yang, et al. ∙

research

∙ 02/21/2023

Learning 3D Photography Videos via Self-supervised Diffusion on Single Images

3D photography renders a static image into a video with appealing 3D vis...

0 Xiaodong Wang, et al. ∙

research

∙ 01/31/2023

NP-Match: Towards a New Probabilistic Model for Semi-Supervised Learning

Semi-supervised learning (SSL) has been widely explored in recent years,...

0 Jianfeng Wang, et al. ∙

research

∙ 12/21/2022

Generalized Decoding for Pixel, Image, and Language

We present X-Decoder, a generalized decoding model that can predict pixe...

10 Xueyan Zou, et al. ∙

research

∙ 12/01/2022

GRiT: A Generative Region-to-text Transformer for Object Understanding

This paper presents a Generative RegIon-to-Text transformer, GRiT, for o...

0 Jialian Wu, et al. ∙

research

∙ 11/21/2022

Exploring Discrete Diffusion Models for Image Captioning

The image captioning task is typically realized by an auto-regressive me...

0 Zixin Zhu, et al. ∙

research

∙ 10/17/2022

Prompting GPT-3 To Be Reliable

Large language models (LLMs) show impressive abilities via few-shot prom...

0 Chenglei Si, et al. ∙

research

∙ 07/20/2022

NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis

In this paper, we present NUWA-Infinity, a generative model for infinite...

4 Chenfei Wu, et al. ∙

research

∙ 07/03/2022

NP-Match: When Neural Processes meet Semi-Supervised Learning

Semi-supervised learning (SSL) has been widely explored in recent years,...

0 Jianfeng Wang, et al. ∙

research

∙ 06/18/2022

Rethinking Bayesian Deep Learning Methods for Semi-Supervised Volumetric Medical Image Segmentation

Recently, several Bayesian deep learning methods have been proposed for ...

0 Jianfeng Wang, et al. ∙

research

∙ 06/16/2022

ALL-MASK: A Reconfigurable Logic Locking Method for Multicore Architecture with Sequential-Instruction-Oriented Key

Intellectual property (IP) piracy has become a non-negligible problem as...

0 Jianfeng Wang, et al. ∙

research

∙ 06/15/2022

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Vision-language (VL) pre-training has recently received considerable att...

13 Zi-Yi Dou, et al. ∙

research

∙ 06/07/2022

Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding

Leveraging large-scale data can introduce performance gains on many comp...

9 Lingchen Meng, et al. ∙

research

∙ 05/27/2022

GIT: A Generative Image-to-text Transformer for Vision and Language

In this paper, we design and train a Generative Image-to-text Transforme...

14 Jianfeng Wang, et al. ∙

research

∙ 03/14/2022

Statistical learning for train delays and influence of winter climate and atmospheric icing

This study investigated the climate effect under consecutive winters on ...

0 Jianfeng Wang, et al. ∙

research

∙ 03/10/2022

The Overlooked Classifier in Human-Object Interaction Recognition

Human-Object Interaction (HOI) recognition is challenging due to two fac...

0 Ying Jin, et al. ∙

research

∙ 12/13/2021

Decoupling Object Detection from Human-Object Interaction Recognition

We propose DEFR, a DEtection-FRee method to recognize Human-Object Inter...

0 Ying Jin, et al. ∙

research

∙ 12/09/2021

Injecting Semantic Concepts into End-to-End Image Captioning

Tremendous progress has been made in recent years in developing better i...

0 Zhiyuan Fang, et al. ∙

research

∙ 11/24/2021

Scaling Up Vision-Language Pre-training for Image Captioning

In recent years, we have witnessed significant performance boost in the ...

0 Xiaowei Hu, et al. ∙

research

∙ 11/23/2021

Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling

In this paper, we propose UNICORN, a vision-language (VL) model that uni...

7 Zhengyuan Yang, et al. ∙

research

∙ 11/22/2021

Florence: A New Foundation Model for Computer Vision

Automated visual understanding of our diverse and open world demands com...

4 Lu Yuan, et al. ∙

research

∙ 11/19/2021

UFO: A UniFied TransfOrmer for Vision-Language Representation Learning

In this paper, we propose a single UniFied transfOrmer (UFO), which is c...

0 Jianfeng Wang, et al. ∙

research

∙ 11/03/2021

An Empirical Study of Training End-to-End Vision-and-Language Transformers

Vision-and-language (VL) pre-training has proven to be highly effective ...

0 Zi-Yi Dou, et al. ∙

research

∙ 10/20/2021

Local Statistics for Spatial Panel Models with Application to the US Electorate

The spatial panel regression model has shown great success in modelling ...

0 Jianfeng Wang, et al. ∙

research

∙ 09/18/2021

Edge Prior Augmented Networks for Motion Deblurring on Naturally Blurry Images

Motion deblurring has witnessed rapid development in recent years, and m...

4 Yuedong Chen, et al. ∙

research

∙ 09/10/2021

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA

Knowledge-based visual question answering (VQA) involves answering quest...

0 Zhengyuan Yang, et al. ∙

research

∙ 08/04/2021

Train performance analysis using heterogeneous statistical models

This study investigated the effect of harsh winter climate on the perfor...

0 Jianfeng Wang, et al. ∙

research

∙ 07/27/2021

Is Object Detection Necessary for Human-Object Interaction Recognition?

This paper revisits human-object interaction (HOI) recognition at image ...

0 Ying Jin, et al. ∙

research

∙ 06/18/2021

RSG: A Simple but Effective Module for Learning Imbalanced Datasets

Imbalanced datasets widely exist in practice and area great challenge fo...

0 Jianfeng Wang, et al. ∙

research

∙ 06/16/2021

End-to-End Semi-Supervised Object Detection with Soft Teacher

This paper presents an end-to-end semi-supervised object detection appro...

0 Mengde Xu, et al. ∙

research

∙ 06/05/2021

Convolutional Neural Networks with Gated Recurrent Connections

The convolutional neural network (CNN) has become a basic model for solv...

0 Jianfeng Wang, et al. ∙

research

∙ 04/05/2021

Compressing Visual-linguistic Model via Knowledge Distillation

Despite exciting progress in pre-training for visual-linguistic (VL) rep...

0 Zhiyuan Fang, et al. ∙

research

∙ 03/30/2021

DAP: Detection-Aware Pre-training with Weak Supervision

This paper presents a detection-aware pre-training (DAP) approach, which...

0 Yuanyi Zhong, et al. ∙

research

∙ 03/22/2021

Adversarial Feature Augmentation and Normalization for Visual Recognition

Recent advances in computer vision take advantage of adversarial data au...

14 Tianlong Chen, et al. ∙

research

∙ 01/16/2021

Galleon: Reshaping the Square Peg of NFV

Software is often used for Network Functions (NFs) – such as firewalls, ...

0 Jianfeng Wang, et al. ∙

research

∙ 01/12/2021

SEED: Self-supervised Distillation For Visual Representation

This paper is concerned with self-supervised learning for small models. ...

2 Zhiyuan Fang, et al. ∙

research

∙ 01/12/2021

LLA: Loss-aware Label Assignment for Dense Pedestrian Detection

Label assignment has been widely studied in general object detection bec...

1 Zheng Ge, et al. ∙

research

∙ 12/13/2020

MiniVLM: A Smaller and Faster Vision-Language Model

Recent vision-language (VL) studies have shown remarkable progress by le...

1 Jianfeng Wang, et al. ∙

research

∙ 12/08/2020

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption

In this paper, we propose Text-Aware Pre-training (TAP) for Text-VQA and...

0 Zhengyuan Yang, et al. ∙

research

∙ 12/07/2020

End-to-End Object Detection with Fully Convolutional Network

Mainstream object detectors based on the fully convolutional network has...

0 Jianfeng Wang, et al. ∙

research

∙ 09/22/2020

Effects of winter climate on high speed passenger trains in Botnia-Atlantica region

Harsh winter climate can cause various problems for both public and priv...

0 Jianfeng Wang, et al. ∙

research

∙ 07/15/2020

Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer

In this paper, we propose an effective knowledge transfer framework to b...

2 Yuanyi Zhong, et al. ∙

research

∙ 07/07/2020

AutoAssign: Differentiable Label Assignment for Dense Object Detection

In this paper, we propose an anchor-free object detector with a fully di...

2 Benjin Zhu, et al. ∙

research

∙ 05/22/2020

Hashing-based Non-Maximum Suppression for Crowded Object Detection

In this paper, we propose an algorithm, named hashing-based non-maximum ...

1 Jianfeng Wang, et al. ∙

research

∙ 08/29/2019

Enhanced block sparse signal recovery based on q-ratio block constrained minimal singular values

In this paper we introduce the q-ratio block constrained minimal singula...

0 Jianfeng Wang, et al. ∙

Jianfeng Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro