Video generation is one of the most challenging tasks in Machine Learnin...
We present MMFT-BERT(MultiModal Fusion Transformer with BERT encodings),...
This paper introduces a new type of image enhancement problem. Compared ...
Several recent projects demonstrated the promise of end-to-end learned d...
This paper tackles the Text Correction (TC) problem, i.e., finding and
r...
Given a video and its incomplete textural description with missing words...