This paper presents a paradigm that adapts general large-scale pretraine...
Despite its better bio-plausibility, goal-driven spiking neural network ...
Speech emotion recognition is crucial to human-computer interaction. The...
Enabled by multi-head self-attention, Transformer has exhibited remarkab...
Paralinguistic speech processing is important in addressing many issues,...
Referring video object segmentation aims to segment the object referred ...
Long-term scene changes present challenges to localization systems using...
Transformer has obtained promising results on cognitive speech signal
pr...
Speech emotion recognition is a challenging and important research topic...
A new unsupervised learning method of depth and ego-motion using multipl...
Speech emotion recognition is a vital contributor to the next generation...
Visual localization is a crucial component in the application of mobile ...
Visual localization is a crucial problem in mobile robotics and autonomo...
In existing visual representation learning tasks, deep convolutional neu...
In this work we propose a new automatic image annotation model, dubbed
...
Saliency computation has become a popular research field for many
applic...