Large language models (LLMs) demonstrate remarkable ability to comprehen...
Large pre-trained language models (LLMs) have been shown to have signifi...
BERT-style models pre-trained on the general corpus (e.g., Wikipedia) an...
Computer-aided diagnosis plays a salient role in more accessible and acc...
Temporal grounding aims to localize a video moment which is semantically...
Deep learning has achieved remarkable progress for visual recognition on...
Background: Maintaining a healthy diet is vital to avoid health-related
...
Making profits in stock market is a challenging task for both profession...
Current video representations heavily rely on learning from manually
ann...