Jinfeng Bai

research

∙ 09/06/2023

GPT Can Solve Mathematical Problems Without a Calculator

Previous studies have typically assumed that large language models are u...

0 Zhen Yang, et al. ∙

research

∙ 08/21/2023

Patch Is Not All You Need

Vision Transformers have achieved great success in computer visions, del...

0 Changzhen Li, et al. ∙

research

∙ 06/02/2023

DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model

Multilingual self-supervised speech representation models have greatly e...

0 Haoyu Wang, et al. ∙

research

∙ 05/09/2023

TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition

Text irregularities pose significant challenges to scene text recognizer...

0 Tianlun Zheng, et al. ∙

research

∙ 04/09/2023

CCLAP: Controllable Chinese Landscape Painting Generation via Latent Diffusion Model

With the development of deep generative models, recent years have seen g...

0 Zhongqi Wang, et al. ∙

research

∙ 12/27/2022

1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation

The task of referring video object segmentation aims to segment the obje...

0 Zhiwei Hu, et al. ∙

research

∙ 12/27/2022

Position-Aware Contrastive Alignment for Referring Image Segmentation

Referring image segmentation aims to segment the target object described...

0 Bo Chen, et al. ∙

research

∙ 11/23/2022

Texts as Images in Prompt Tuning for Multi-Label Image Recognition

Prompt tuning has been employed as an efficient way to adapt large visio...

0 Zixian Guo, et al. ∙

research

∙ 11/02/2022

DSPGAN: a GAN-based universal vocoder for high-fidelity TTS by time-frequency domain supervision from DSP

Recent development of neural vocoders based on the generative adversaria...

0 Kun Song, et al. ∙

research

∙ 10/18/2022

1st Place Solutions for the UVO Challenge 2022

This paper describes the approach we have taken in the challenge. We sti...

0 Jiajun Zhang, et al. ∙

research

∙ 10/12/2022

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Code-switching automatic speech recognition becomes one of the most chal...

0 Shuhao Deng, et al. ∙

research

∙ 07/18/2022

Towards Diverse and Faithful One-shot Adaption of Generative Adversarial Networks

One-shot generative domain adaption aims to transfer a pre-trained gener...

0 Yabo Zhang, et al. ∙

research

∙ 06/27/2022

TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline

This paper introduces a new corpus of Mandarin-English code-switching sp...

0 Chengfei Li, et al. ∙

research

∙ 03/01/2022

BERT-LID: Leveraging BERT to Improve Spoken Language Identification

Language identification is a task of automatically determining the ident...

0 Yuting Nie, et al. ∙

Jinfeng Bai

Featured Co-authors

Sign in with Google

Consider DeepAI Pro