Max Bain

research

∙ 07/18/2023

OxfordVGG Submission to the EGO4D AV Transcription Challenge

This report presents the technical details of our submission on the EGO4...

0 Jaesung Huh, et al. ∙

research

∙ 05/24/2023

Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets

Vision-language models are growing in popularity and public visibility t...

5 Brandon Smith, et al. ∙

research

∙ 03/29/2023

AutoAD: Movie Description in Context

The objective of this paper is an automatic Audio Description (AD) model...

14 Tengda Han, et al. ∙

research

∙ 03/01/2023

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Large-scale, weakly-supervised speech recognition models, such as Whispe...

0 Max Bain, et al. ∙

research

∙ 05/17/2022

A CLIP-Hitchhiker's Guide to Long Video Retrieval

Our goal in this paper is the adaptation of image-text models for long v...

13 Max Bain, et al. ∙

research

∙ 03/22/2022

A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning

Vision-language models can encode societal biases and stereotypes, but t...

2 Hugo Elias Berg, et al. ∙

research

∙ 04/01/2021

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval

Our objective in this work is video-text retrieval - in particular a joi...

7 Max Bain, et al. ∙

research

∙ 05/08/2020

Condensed Movies: Story Based Retrieval with Contextual Embeddings

Our objective in this work is the long range understanding of the narrat...

10 Max Bain, et al. ∙

research

∙ 09/19/2019

Count, Crop and Recognise: Fine-Grained Recognition in the Wild

The goal of this paper is to label all the animal individuals present in...

34 Max Bain, et al. ∙

Max Bain

Featured Co-authors

Sign in with Google

Consider DeepAI Pro