Peng Wang

research

∙ 09/14/2023

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

In spite of the excellent strides made by end-to-end (E2E) models in spe...

0 Peng Wang, et al. ∙

research

∙ 08/31/2023

TouchStone: Evaluating Vision-Language Models by Language Models

Large vision-language models (LVLMs) have recently witnessed rapid advan...

0 Shuai Bai, et al. ∙

research

∙ 08/31/2023

MVDream: Multi-view Diffusion for 3D Generation

We propose MVDream, a multi-view diffusion model that is able to generat...

0 Yichun Shi, et al. ∙

research

∙ 08/24/2023

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

We introduce the Qwen-VL series, a set of large-scale vision-language mo...

0 Jinze Bai, et al. ∙

research

∙ 08/24/2023

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

The diversity in length constitutes a significant characteristic of text...

0 Changxu Cheng, et al. ∙

research

∙ 08/24/2023

Ground-to-Aerial Person Search: Benchmark Dataset and Approach

In this work, we construct a large-scale dataset for Ground-to-Aerial Pe...

0 Shizhou Zhang, et al. ∙

research

∙ 08/20/2023

Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction

To obtain high-quality positron emission tomography (PET) scans while re...

0 Zeyu Han, et al. ∙

research

∙ 08/20/2023

Polymerized Feature-based Domain Adaptation for Cervical Cancer Dose Map Prediction

Recently, deep learning (DL) has automated and accelerated the clinical ...

0 Jie Zeng, et al. ∙

research

∙ 08/16/2023

Pre-training with Large Language Model-based Document Expansion for Dense Passage Retrieval

In this paper, we systematically study the potential of pre-training wit...

0 Guangyuan Ma, et al. ∙

research

∙ 08/14/2023

EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models

Large Language Models (LLMs) usually suffer from knowledge cutoff or fal...

0 Peng Wang, et al. ∙

research

∙ 08/03/2023

A Survey on Deep Learning-based Spatio-temporal Action Detection

Spatio-temporal action detection (STAD) aims to classify the actions pre...

0 Peng Wang, et al. ∙

research

∙ 07/25/2023

Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition

Due to the enormous technical challenges and wide range of applications,...

0 Cheng Da, et al. ∙

research

∙ 07/20/2023

Pre-train, Adapt and Detect: Multi-Task Adapter Tuning for Camouflaged Object Detection

Camouflaged object detection (COD), aiming to segment camouflaged object...

0 Yinghui Xing, et al. ∙

research

∙ 07/19/2023

Watch out Venomous Snake Species: A Solution to SnakeCLEF2023

The SnakeCLEF2023 competition aims to the development of advanced algori...

0 Feiran Hu, et al. ∙

research

∙ 07/14/2023

Sparsified Simultaneous Confidence Intervals for High-Dimensional Linear Models

Statistical inference of the high-dimensional regression coefficients is...

0 Xiaorui Zhu, et al. ∙

research

∙ 07/03/2023

MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

This paper introduces MVDiffusion, a simple yet effective multi-view ima...

0 Shitao Tang, et al. ∙

research

∙ 06/28/2023

Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

Previously, Target Speaker Extraction (TSE) has yielded outstanding perf...

0 Jiuxin Lin, et al. ∙

research

∙ 06/01/2023

Teacher Agent: A Non-Knowledge Distillation Method for Rehearsal-based Video Incremental Learning

With the rise in popularity of video-based social media, new categories ...

0 Shengqin Jiang, et al. ∙

research

∙ 05/29/2023

Learning Conditional Attributes for Compositional Zero-Shot Learning

Compositional Zero-Shot Learning (CZSL) aims to train models to recogniz...

0 Qingsheng Wang, et al. ∙

research

∙ 05/27/2023

NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

We present a neural rendering-based method called NeRO for reconstructin...

0 Yuan Liu, et al. ∙

research

∙ 05/23/2023

A New Comprehensive Benchmark for Semi-supervised Video Anomaly Detection and Anticipation

Semi-supervised video anomaly detection (VAD) is a critical task in the ...

0 Congqi Cao, et al. ∙

research

∙ 05/22/2023

Editing Large Language Models: Problems, Methods, and Opportunities

Recent advancements in deep learning have precipitated the emergence of ...

0 Yunzhi Yao, et al. ∙

research

∙ 05/18/2023

ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

In this work, we explore a scalable way for building a general represent...

0 Peng Wang, et al. ∙

research

∙ 05/15/2023

Knowledge Rumination for Pre-trained Language Models

Previous studies have revealed that vanilla pre-trained language models ...

0 Yunzhi Yao, et al. ∙

research

∙ 05/07/2023

Fast Blind Recovery of Linear Block Codes over Noisy Channels

This paper addresses the blind recovery of the parity check matrix of an...

0 Peng Wang, et al. ∙

research

∙ 04/29/2023

ViewFormer: View Set Attention for Multi-view 3D Shape Understanding

This paper presents ViewFormer, a simple yet effective model for multi-v...

0 Hongyu Sun, et al. ∙

research

∙ 04/27/2023

Maximizing Model Generalization for Manufacturing with Self-Supervised Learning and Federated Learning

Deep Learning (DL) can diagnose faults and assess machine health from ra...

0 Matthew Russell, et al. ∙

research

∙ 04/23/2023

AirBirds: A Large-scale Challenging Dataset for Bird Strike Prevention in Real-world Airports

One fundamental limitation to the research of bird strike prevention is ...

0 Hongyu Sun, et al. ∙

research

∙ 04/20/2023

A geometry-aware deep network for depth estimation in monocular endoscopy

Monocular depth estimation is critical for endoscopists to perform spati...

0 Yongming Yang, et al. ∙

research

∙ 04/20/2023

CoT-MoTE: Exploring ConTextual Masked Auto-Encoder Pre-training with Mixture-of-Textual-Experts for Passage Retrieval

Passage retrieval aims to retrieve relevant passages from large collecti...

0 Guangyuan Ma, et al. ∙

research

∙ 04/19/2023

Progressive Transfer Learning for Dexterous In-Hand Manipulation with Multi-Fingered Anthropomorphic Hand

Dexterous in-hand manipulation for a multi-fingered anthropomorphic hand...

0 Yongkang Luo, et al. ∙

research

∙ 04/17/2023

Dumpy: A Compact and Adaptive Index for Large Data Series Collections

Data series indexes are necessary for managing and analyzing the increas...

0 Zeyu Wang, et al. ∙

research

∙ 04/05/2023

CoT-MAE v2: Contextual Masked Auto-Encoder with Multi-view Modeling for Passage Retrieval

Growing techniques have been emerging to improve the performance of pass...

0 Xing Wu, et al. ∙

research

∙ 03/31/2023

Generalized Anthropomorphic Functional Grasping with Minimal Demonstrations

This article investigates the challenge of achieving functional tool-use...

0 Wei Wei, et al. ∙

research

∙ 03/28/2023

F^2-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories

This paper presents a novel grid-based NeRF called F2-NeRF (Fast-Free-Ne...

0 Peng Wang, et al. ∙

research

∙ 02/21/2023

Joint Spectrum and Power Allocation for V2X Communications with Imperfect CSI

In Vehicle-to-Everything (V2X) communication, the high mobility of vehic...

0 Peng Wang, et al. ∙

research

∙ 02/13/2023

Learning Tri-mode Grasping for Ambidextrous Robot Picking

Object picking in cluttered scenes is a widely investigated field of rob...

0 Chenlin Zhou, et al. ∙

research

∙ 02/09/2023

Self-Supervised Node Representation Learning via Node-to-Neighbourhood Alignment

Self-supervised node representation learning aims to learn node represen...

0 Wei Dong, et al. ∙

research

∙ 02/07/2023

Delving Deep into Simplicity Bias for Long-Tailed Image Recognition

Simplicity Bias (SB) is a phenomenon that deep neural networks tend to r...

0 Xiu-Shen Wei, et al. ∙

research

∙ 02/06/2023

Industrial computed tomography based intelligent non-destructive testing method for power capacitor

Power capacitor device is a widely used reactive power compensation equi...

0 Zhenxing Cheng, et al. ∙

research

∙ 01/27/2023

Data Volume-aware Computation Task Scheduling for Smart Grid Data Analytic Applications

Emerging smart grid applications analyze large amounts of data collected...

0 Binquan Guo, et al. ∙

research

∙ 01/19/2023

Multimodal Video Adapter for Parameter Efficient Video Text Retrieval

State-of-the-art video-text retrieval (VTR) methods usually fully fine-t...

0 Bowen Zhang, et al. ∙

research

∙ 12/28/2022

Optimizing Replacement Policies for Content Delivery Network Caching: Beyond Belady to Attain A Seemingly Unattainable Byte Miss Ratio

When facing objects/files of differing sizes in content delivery network...

0 Peng Wang, et al. ∙

research

∙ 12/19/2022

Transferring General Multimodal Pretrained Models to Text Recognition

This paper proposes a new method, OFA-OCR, to transfer multimodal pretra...

0 Junyang Lin, et al. ∙

research

∙ 12/16/2022

Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering Guidance

Weakly supervised video anomaly detection (WSVAD) is a challenging task ...

0 Congqi Cao, et al. ∙

research

∙ 12/08/2022

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models

Generalist models, which are capable of performing diverse multi-modal t...

0 Jinze Bai, et al. ∙

research

∙ 12/05/2022

Generalizable Person Re-Identification via Viewpoint Alignment and Fusion

In the current person Re-identification (ReID) methods, most domain gene...

0 Bingliang Jiao, et al. ∙

research

∙ 11/25/2022

NeuralUDF: Learning Unsigned Distance Fields for Multi-view Reconstruction of Surfaces with Arbitrary Topologies

We present a novel method, called NeuralUDF, for reconstructing surfaces...

0 Xiaoxiao Long, et al. ∙

research

∙ 11/23/2022

BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields

Neural Radiance Fields (NeRF) have received considerable attention recen...

0 Peng Wang, et al. ∙

research

∙ 11/22/2022

Semantic Guided Level-Category Hybrid Prediction Network for Hierarchical Image Classification

Hierarchical classification (HC) assigns each object with multiple label...

0 Peng Wang, et al. ∙

Peng Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro