b'Yu Zhang'

research

∙ 09/21/2023

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Large language models (LLMs) have pushed the limits of natural language ...

0 Longhui Yu, et al. ∙

research

∙ 09/21/2023

Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation

Referring Video Object Segmentation (RVOS) requires segmenting the objec...

0 Ping Li, et al. ∙

research

∙ 09/21/2023

Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation

Unsupervised Video Object Segmentation (VOS) aims at identifying the con...

0 Ping Li, et al. ∙

research

∙ 09/19/2023

Multimodal Modeling For Spoken Language Identification

Spoken language identification refers to the task of automatically predi...

0 Shikhar Bharadwaj, et al. ∙

research

∙ 09/16/2023

Learning a Stable Dynamic System with a Lyapunov Energy Function for Demonstratives Using Neural Networks

Autonomous Dynamic System (DS)-based algorithms hold a pivotal and found...

0 Yu Zhang, et al. ∙

research

∙ 09/14/2023

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

We introduce a multilingual speaker change detection model (USM-SCD) tha...

0 Guanlong Zhao, et al. ∙

research

∙ 09/04/2023

StereoFlowGAN: Co-training for Stereo and Flow with Unsupervised Domain Adaptation

We introduce a novel training strategy for stereo matching and optical f...

0 Zhexiao Xiong, et al. ∙

research

∙ 09/03/2023

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

While large language models (LLMs) have demonstrated remarkable capabili...

0 Yue Zhang, et al. ∙

research

∙ 08/30/2023

OldVisOnline: Curating a Dataset of Historical Visualizations

With the increasing adoption of digitization, more and more historical v...

0 Yu Zhang, et al. ∙

research

∙ 08/30/2023

Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object Tracking

Multi-Object Tracking (MOT) is a crucial computer vision task that aims ...

0 Yukun Su, et al. ∙

research

∙ 08/23/2023

A Scale-Invariant Task Balancing Approach for Multi-Task Learning

Multi-task learning (MTL), a learning paradigm to learn multiple related...

0 Baijiong Lin, et al. ∙

research

∙ 08/16/2023

High-Fidelity Lake Extraction via Two-Stage Prompt Enhancement: Establishing a Novel Baseline and Benchmark

The extraction of lakes from remote sensing images is a complex challeng...

0 Ben Chen, et al. ∙

research

∙ 08/15/2023

Forward-Backward Reasoning in Large Language Models for Verification

Chain-of-Though (CoT) prompting has shown promising performance in vario...

0 Weisen Jiang, et al. ∙

research

∙ 08/08/2023

DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images

Optical satellite images are a critical data source; however, cloud cove...

0 Xuechao Zou, et al. ∙

research

∙ 08/08/2023

LEFormer: A Hybrid CNN-Transformer Architecture for Accurate Lake Extraction from Remote Sensing Imagery

Lake extraction from remote sensing imagery is challenging due to the co...

0 Ben Chen, et al. ∙

research

∙ 08/02/2023

Decomposing and Coupling Saliency Map for Lesion Segmentation in Ultrasound Images

Complex scenario of ultrasound image, in which adjacent tissues (i.e., b...

0 Zhenyuan Ning, et al. ∙

research

∙ 07/27/2023

MATNilm: Multi-appliance-task Non-intrusive Load Monitoring with Limited Labeled Data

Non-intrusive load monitoring (NILM) identifies the status and power con...

0 Jing Xiong, et al. ∙

research

∙ 07/25/2023

A Dual-mode Local Search Algorithm for Solving the Minimum Dominating Set Problem

Given a graph, the minimum dominating set (MinDS) problem is to identify...

0 Enqiang Zhu, et al. ∙

research

∙ 07/20/2023

"It Felt Like Having a Second Mind": Investigating Human-AI Co-creativity in Prewriting with Large Language Models

Prewriting is the process of discovering and developing ideas before a f...

0 Qian Wan, et al. ∙

research

∙ 06/24/2023

Weakly Supervised Multi-Label Classification of Full-Text Scientific Papers

Instead of relying on human-annotated training samples to build a classi...

0 Yu Zhang, et al. ∙

research

∙ 06/22/2023

AudioPaLM: A Large Language Model That Can Speak and Listen

We introduce AudioPaLM, a large language model for speech understanding ...

0 Paul K. Rubenstein, et al. ∙

research

∙ 06/22/2023

FlowFace++: Explicit Semantic Flow-supervised End-to-End Face Swapping

This work proposes a novel face-swapping framework FlowFace++, utilizing...

0 Yu Zhang, et al. ∙

research

∙ 06/20/2023

Learning Variable Impedance Skills from Demonstrations with Passivity Guarantee

Robots are increasingly being deployed not only in workplaces but also i...

0 Yu Zhang, et al. ∙

research

∙ 06/13/2023

Efficient Adapters for Giant Speech Models

Large pre-trained speech models are widely used as the de-facto paradigm...

0 Nanxin Chen, et al. ∙

research

∙ 06/13/2023

PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer

Personalized dialogue agents (DAs) powered by large pre-trained language...

0 Xu Han, et al. ∙

research

∙ 06/07/2023

Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation

Few-shot question answering (QA) aims at precisely discovering answers t...

0 Xiusi Chen, et al. ∙

research

∙ 06/06/2023

COPR: Consistency-Oriented Pre-Ranking for Online Advertising

Cascading architecture has been widely adopted in large-scale advertisin...

0 Zhishan Zhao, et al. ∙

research

∙ 06/02/2023

Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection

LiDAR and Radar are two complementary sensing approaches in that LiDAR s...

0 Yingjie Wang, et al. ∙

research

∙ 06/01/2023

Explanation Graph Generation via Generative Pre-training over Synthetic Graphs

The generation of explanation graphs is a significant task that aims to ...

0 Han Cui, et al. ∙

research

∙ 06/01/2023

Effective Structured Prompting by Meta-Learning and Representative Verbalizer

Prompt tuning for pre-trained masked language models (MLM) has shown pro...

0 Weisen Jiang, et al. ∙

research

∙ 06/01/2023

How to Estimate Model Transferability of Pre-Trained Speech Models?

In this work, we introduce a “score-based assessment” framework for esti...

0 Zih-Ching Chen, et al. ∙

research

∙ 05/30/2023

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus

This paper introduces a new speech dataset called “LibriTTS-R” designed ...

0 Yuma Koizumi, et al. ∙

research

∙ 05/25/2023

Mixture-of-Expert Conformer for Streaming Multilingual ASR

End-to-end models with large capacity have significantly improved multil...

0 Ke Hu, et al. ∙

research

∙ 05/23/2023

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding

Scientific literature understanding tasks have gained significant attent...

0 Yu Zhang, et al. ∙

research

∙ 05/22/2023

Capturing Conversion Rate Fluctuation during Sales Promotions: A Novel Historical Data Reuse Approach

Conversion rate (CVR) prediction is one of the core components in online...

0 Zhangming Chan, et al. ∙

research

∙ 05/20/2023

Patton: Language Model Pretraining on Text-Rich Networks

A real-world text corpus sometimes comprises not only text documents but...

0 Bowen Jin, et al. ∙

research

∙ 05/13/2023

Temporal Consistent Automatic Video Colorization via Semantic Correspondence

Video colorization task has recently attracted wide attention. Recent me...

9 Yu Zhang, et al. ∙

research

∙ 05/10/2023

A Self-Training Framework Based on Multi-Scale Attention Fusion for Weakly Supervised Semantic Segmentation

Weakly supervised semantic segmentation (WSSS) based on image-level labe...

5 Guoqing Yang, et al. ∙

research

∙ 05/08/2023

A Unifying Framework of Attention-based Neural Load Forecasting

Accurate load forecasting is critical for reliable and efficient plannin...

0 Jing Xiong, et al. ∙

research

∙ 05/04/2023

Chain-of-Skills: A Configurable Model for Open-domain Question Answering

The retrieval model is an indispensable component for real-world knowled...

0 Kaixin Ma, et al. ∙

research

∙ 05/03/2023

Transforming Visual Scene Graphs to Image Captions

We propose to Transform Scene Graphs (TSG) into more descriptive caption...

0 Xu Yang, et al. ∙

research

∙ 04/28/2023

An Adaptive Policy to Employ Sharpness-Aware Minimization

Sharpness-aware minimization (SAM), which searches for flat minima by mi...

0 Weisen Jiang, et al. ∙

research

∙ 04/27/2023

Understanding Shared Speech-Text Representations

Recently, a number of approaches to train speech models by incorpo-ratin...

0 Gary Wang, et al. ∙

research

∙ 04/25/2023

Detection of Pavement Cracks by Deep Learning Models of Transformer and UNet

Fracture is one of the main failure modes of engineering structures such...

0 Yu Zhang, et al. ∙

research

∙ 04/20/2023

Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning

Asymmetrical multiplayer (AMP) game is a popular game genre which involv...

0 Chenglu Sun, et al. ∙

research

∙ 04/16/2023

Handling Heavy Occlusion in Dense Crowd Tracking by Focusing on the Heads

With the rapid development of deep learning, object detection and tracki...

0 Yu Zhang, et al. ∙

research

∙ 04/13/2023

SPColor: Semantic Prior Guided Exemplar-based Image Colorization

Exemplar-based image colorization aims to colorize a target grayscale im...

0 Siqi Chen, et al. ∙

research

∙ 04/11/2023

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

We propose Conditional Adapter (CoDA), a parameter-efficient transfer le...

0 Tao Lei, et al. ∙

research

∙ 04/06/2023

A Fast and Lightweight Network for Low-Light Image Enhancement

Low-light images often suffer from severe noise, low brightness, low con...

0 Yu Zhang, et al. ∙

research

∙ 04/04/2023

Safe Explicable Robot Planning

Human expectations stem from their knowledge of the others and the world...

0 Akkamahadevi Hanni, et al. ∙

Yu Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro