Event schemas are a form of world knowledge about the typical progressio...
Online resources such as WikiHow compile a wide range of scripts for
per...
In recent years, large language models (LMs) have achieved remarkable
pr...
Emerging events, such as the COVID pandemic and the Ukraine Crisis, requ...
Several works have proven that finetuning is an applicable approach for
...
Video event extraction aims to detect salient events from a video and
id...
Recent advances in pre-training vision-language models like CLIP have sh...
Goal-oriented generative script learning aims to generate subsequent ste...
Multi-channel video-language retrieval require models to understand
info...
The goal of this work is to build flexible video-language models that ca...
Despite achieving state-of-the-art zero-shot performance, existing
visio...
Vision-language (V+L) pretraining models have achieved great success in
...
Recently, there has been an increasing interest in building question
ans...
Visual and textual modalities contribute complementary information about...
Event schemas encode knowledge of stereotypical structures of events and...
To combat COVID-19, both clinicians and scientists need to digest the va...
We introduce a new task, MultiMedia Event Extraction (M2E2), which aims ...
Fine-grained entity typing aims to assign entity mentions in the free te...
Knowledge graph embedding aims to embed entities and relations of knowle...