Wei Zou

research

∙ 07/28/2023

ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation

This paper presents the development and evaluation of ChatHome, a domain...

0 Cheng Wen, et al. ∙

research

∙ 08/17/2022

Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition

We investigate robustness properties of pre-trained neural models for au...

0 Goutham Rajendran, et al. ∙

research

∙ 08/01/2022

DSLA: Dynamic smooth label assignment for efficient anchor-free object detection

Anchor-free detectors basically formulate object detection as dense clas...

0 Hu Su, et al. ∙

research

∙ 04/19/2022

Audio-Visual Wake Word Spotting System For MISP Challenge 2021

This paper presents the details of our system designed for the Task 1 of...

0 Yanguang Xu, et al. ∙

research

∙ 09/08/2021

Metrics to find a surrogate endpoint of OS in metastatic oncology trials: a simulation study

Surrogate endpoint (SE) for overall survival (OS) in cancer patients is ...

0 Wei Zou, et al. ∙

research

∙ 06/13/2021

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

This paper introduces GigaSpeech, an evolving, multi-domain English spee...

0 Guoguo Chen, et al. ∙

research

∙ 02/19/2021

SEPAL: Towards a Large-scale Analysis of SEAndroid Policy Customization

To investigate the status quo of SEAndroid policy customization, we prop...

0 Dongsong Yu, et al. ∙

research

∙ 10/27/2020

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning

Self-supervised visual pretraining has shown significant progress recent...

0 Dongwei Jiang, et al. ∙

research

∙ 10/21/2020

TMT: A Transformer-based Modal Translator for Improving Multimodal Sequence Representations in Audio Visual Scene-aware Dialog

Audio Visual Scene-aware Dialog (AVSD) is a task to generate responses w...

0 Wubo Li, et al. ∙

research

∙ 07/29/2020

Transformer based unsupervised pre-training for acoustic representation learning

Computational audio analysis has become a central issue in associated ar...

0 Ruixiong Zhang, et al. ∙

research

∙ 05/20/2020

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

Building a good speech recognition system usually requires large amounts...

0 Dongwei Jiang, et al. ∙

research

∙ 11/09/2019

A Reinforced Generation of Adversarial Samples for Neural Machine Translation

Neural machine translation systems tend to fail on less de-cent inputs d...

0 Wei Zou, et al. ∙

research

∙ 10/23/2019

TCT: A Cross-supervised Learning Method for Multimodal Sequence Representation

Multimodalities provide promising performance than unimodality in most t...

0 Wubo Li, et al. ∙

research

∙ 10/22/2019

Cross-task pre-training for acoustic scene classification

Acoustic scene classification(ASC) and acoustic event detection(AED) are...

0 Ruixiong Zhang, et al. ∙

research

∙ 10/22/2019

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training

Speech recognition technologies are gaining enormous popularity in vario...

0 Dongwei Jiang, et al. ∙

research

∙ 09/24/2019

The Field-of-View Constraint of Markers for Mobile Robot with Pan-Tilt Camera

In the field of navigation and visual servo, it is common to calculate r...

0 Hongxuan Ma, et al. ∙

research

∙ 09/19/2019

EPOSIT: An Absolute Pose Estimation Method for Pinhole and Fish-Eye Cameras

This paper presents a generic 6DOF camera pose estimation method, which ...

0 Zhaobing Kang, et al. ∙

research

∙ 09/13/2019

Human Following for Wheeled Robot with Monocular Pan-tilt Camera

Human following on mobile robots has witnessed significant advances due ...

5 Zheng Zhu, et al. ∙

research

∙ 08/26/2019

High Performance Visual Object Tracking with Unified Convolutional Networks

Convolutional neural networks (CNN) based tracking approaches have shown...

8 Zheng Zhu, et al. ∙

research

∙ 08/24/2019

Camera Pose Correction in SLAM Based on Bias Values of Map Points

Accurate camera pose estimation result is essential for visual SLAM (VSL...

0 Zhaobing Kang, et al. ∙

research

∙ 08/15/2019

FastPose: Towards Real-time Pose Estimation and Tracking via Scale-normalized Multi-task Networks

Both accuracy and efficiency are significant for pose estimation and tra...

0 Jiabin Zhang, et al. ∙

research

∙ 08/02/2019

DELTA: A DEep learning based Language Technology plAtform

In this paper we present DELTA, a deep learning based language technolog...

0 Kun Han, et al. ∙

research

∙ 01/06/2019

Motion Control on Bionic Eyes: A Comprehensive Review

Biology can provide biomimetic components and new control principles for...

0 Zheng Zhu, et al. ∙

research

∙ 12/14/2018

Action Machine: Rethinking Action Recognition in Trimmed Videos

Existing methods in video action recognition mostly do not distinguish h...

26 Jiagang Zhu, et al. ∙

research

∙ 11/18/2018

Optical Flow Based Online Moving Foreground Analysis

Obtained by moving object detection, the foreground mask result is unsha...

0 Junjie Huang, et al. ∙

research

∙ 11/18/2018

An Efficient Optical Flow Based Motion Detection Method for Non-stationary Scenes

Real-time motion detection in non-stationary scenes is a difficult task ...

0 Junjie Huang, et al. ∙

research

∙ 10/31/2018

Towards End-to-End Code-Switching Speech Recognition

Code-switching speech recognition has attracted an increasing interest r...

0 Ne Luo, et al. ∙

research

∙ 07/13/2018

Optical Flow Based Real-time Moving Object Detection in Unconstrained Scenes

Real-time moving object detection in unconstrained scenes is a difficult...

0 Junjie Huang, et al. ∙

research

∙ 05/10/2018

A comparable study of modeling units for end-to-end Mandarin speech recognition

End-To-End speech recognition have become increasingly popular in mandar...

0 Wei Zou, et al. ∙

research

∙ 11/11/2017

End-to-end Video-level Representation Learning for Action Recognition

From the frame/clip-level feature learning to the video-level representa...

0 Jiagang Zhu, et al. ∙

research

∙ 11/10/2017

UCT: Learning Unified Convolutional Networks for Real-time Visual Tracking

Convolutional neural networks (CNN) based tracking approaches have shown...

0 Zheng Zhu, et al. ∙

research

∙ 11/03/2017

End-to-end Flow Correlation Tracking with Spatial-temporal Attention

Discriminative correlation filters (DCF) with deep convolutional feature...

0 Zheng Zhu, et al. ∙

research

∙ 09/12/2017

Learning Gating ConvNet for Two-Stream based Methods in Action Recognition

For the two-stream style methods in action recognition, fusing the two s...

0 Jiagang Zhu, et al. ∙

Wei Zou

Featured Co-authors

Sign in with Google

Consider DeepAI Pro