We present TransNormerLLM, the first linear attention-based Large Langua...
We explore a new task for audio-visual-language modeling called fine-gra...
Vision Transformers have achieved impressive performance in video
classi...
Fine-tuning is widely applied in image classification tasks as a transfe...
Physical symptoms caused by high stress commonly happen in our daily liv...
Deep learning (DL) models are widely used to provide a more convenient a...
The current standard for a variety of computer vision tasks using smalle...
Three-dimensional face reconstruction is one of the popular applications...