Studies on semi-supervised medical image segmentation (SSMIS) have seen ...
We propose MemoChat, a pipeline for refining instructions that enables l...
Multimodal headline utilizes both video frames and transcripts to genera...
In the scenario of unsupervised extractive summarization, learning
high-...
Recent language generative models are mostly trained on large-scale data...
Scene segmentation and classification (SSC) serve as a critical step tow...
In document-level event extraction (DEE) task, event arguments always sc...
The extraction of text information in videos serves as a critical step
t...