Large Vision-Language Models (LVLMs) have recently achieved remarkable
s...
With the rapid evolution of large language models (LLMs), there is a gro...
Document understanding refers to automatically extract, analyze and
comp...
To promote the development of Vision-Language Pre-training (VLP) and
mul...
Existing knowledge-enhanced methods have achieved remarkable results in
...
Knowledge distillation is of key importance to launching multilingual
pr...
Large language models (LLMs) have demonstrated impressive zero-shot abil...
In this paper, we present ChatPLUG, a Chinese open-domain dialogue syste...
Recent years have witnessed a big convergence of language, vision, and
m...
Video-language pre-training has advanced the performance of various
down...
Although pre-trained language models (PLMs) have achieved state-of-the-a...
Video-text retrieval has been a crucial and fundamental task in multi-mo...
Large-scale pretrained foundation models have been an emerging paradigm ...
Live streaming is becoming an increasingly popular trend of sales in
E-c...
Pre-sales customer service is of importance to E-commerce platforms as i...