Effective scaling and a flexible task interface enable large language mo...
Dense video captioning aims to identify the events of interest in an inp...
Image captioning involves identifying semantic concepts in the scene and...
Automatic image captioning has improved significantly in the last few ye...