Learning image representations using synthetic data allows training neur...
The recent rapid advances in machine learning technologies largely depen...
Designing better machine translation systems by considering auxiliary in...
Pre-training models on Imagenet or other massive datasets of real images...
Deep convolutional networks have recently achieved great success in vide...
Most existing works in few-shot learning rely on meta-learning the netwo...
Vision transformer (ViT) has recently showed its strong capability in
ac...
Multi-modal learning, which focuses on utilizing various modalities to
i...
Nowadays, there is an abundance of data involving images and surrounding...
The recently developed vision transformer (ViT) has achieved promising
r...
Tremendous progress has been made in visual representation learning, not...
Network quantization has rapidly become one of the most widely used meth...
Neural Architecture Search (NAS) is a powerful tool to automatically des...
In recent years, a number of approaches based on 2D CNNs and 3D CNNs hav...
Neural Architecture Search (NAS) is an open and challenging problem in
m...
Current state-of-the-art models for video action recognition are mostly ...
In this paper, we propose a novel Convolutional Neural Network (CNN)
arc...
Motivated by the need to extract knowledge and value from interconnected...
To reduce the significant redundancy in deep Convolutional Neural Networ...