Audio-visual automatic speech recognition (AV-ASR) is an extension of AS...
Pre-training on large scale unlabelled datasets has shown impressive
per...
The task of retrieving video content relevant to natural language querie...
In this paper, we tackle the problem of 3D human shape estimation from s...