Visually-grounded dialog systems, which integrate multiple modes of
comm...
Multi-object tracking (MOT) is a challenging vision task that aims to de...
Transformer framework has been showing superior performances in visual o...
Multi-object tracking (MOT) aims at estimating bounding boxes and identi...