This paper presents the development and evaluation of ChatHome, a
domain...
We investigate robustness properties of pre-trained neural models for
au...
Anchor-free detectors basically formulate object detection as dense
clas...
This paper presents the details of our system designed for the Task 1 of...
Surrogate endpoint (SE) for overall survival (OS) in cancer patients is
...
This paper introduces GigaSpeech, an evolving, multi-domain English spee...
To investigate the status quo of SEAndroid policy customization, we prop...
Self-supervised visual pretraining has shown significant progress recent...
Audio Visual Scene-aware Dialog (AVSD) is a task to generate responses w...
Computational audio analysis has become a central issue in associated ar...
Building a good speech recognition system usually requires large amounts...
Neural machine translation systems tend to fail on less de-cent inputs
d...
Multimodalities provide promising performance than unimodality in most t...
Acoustic scene classification(ASC) and acoustic event detection(AED) are...
Speech recognition technologies are gaining enormous popularity in vario...
In the field of navigation and visual servo, it is common to calculate
r...
This paper presents a generic 6DOF camera pose estimation method, which ...
Human following on mobile robots has witnessed significant advances due ...
Convolutional neural networks (CNN) based tracking approaches have shown...
Accurate camera pose estimation result is essential for visual SLAM (VSL...
Both accuracy and efficiency are significant for pose estimation and tra...
In this paper we present DELTA, a deep learning based language technolog...
Biology can provide biomimetic components and new control principles for...
Existing methods in video action recognition mostly do not distinguish h...
Obtained by moving object detection, the foreground mask result is unsha...
Real-time motion detection in non-stationary scenes is a difficult task ...
Code-switching speech recognition has attracted an increasing interest
r...
Real-time moving object detection in unconstrained scenes is a difficult...
End-To-End speech recognition have become increasingly popular in mandar...
From the frame/clip-level feature learning to the video-level representa...
Convolutional neural networks (CNN) based tracking approaches have shown...
Discriminative correlation filters (DCF) with deep convolutional feature...
For the two-stream style methods in action recognition, fusing the two
s...