Despite significant progress in neural abstractive summarization, recent...
Vision-and-language navigation (VLN) is a multimodal task where an agent...
A major challenge in visually grounded language generation is to build r...
In the vision-and-language navigation (VLN) task, an agent follows natur...
Multi-sentence summarization is a well studied problem in NLP, while
gen...
There is a recent surge of interest in cross-modal representation learni...