Brais Martinez

research

∙ 07/28/2023

SimDETR: Simplifying self-supervised pretraining for DETR

DETR-based object detectors have achieved remarkable performance but are...

0 Ioannis Maniadis Metaxas, et al. ∙

research

∙ 04/04/2023

Black Box Few-Shot Adaptation for Vision-Language models

Vision-Language (V-L) models trained with contrastive learning to align ...

0 Yassine Ouali, et al. ∙

research

∙ 10/10/2022

FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

This paper is on Few-Shot Object Detection (FSOD), where given a few tem...

14 Adrian Bulat, et al. ∙

research

∙ 10/06/2022

Effective Self-supervised Pre-training on Low-compute networks without Distillation

Despite the impressive progress of self-supervised learning (SSL), its a...

0 Fuwen Tan, et al. ∙

research

∙ 10/05/2022

Variational prompt tuning improves generalization of vision-language models

Prompt tuning provides an efficient mechanism to adapt large vision-lang...

3 Mohammad Mahdi Derakhshani, et al. ∙

research

∙ 09/29/2022

REST: REtrieve Self-Train for generative action recognition

This work is on training a generative action/video recognition model who...

0 Adrian Bulat, et al. ∙

research

∙ 08/23/2022

Efficient Attention-free Video Shift Transformers

This paper tackles the problem of efficient video recognition. In this a...

0 Adrian Bulat, et al. ∙

research

∙ 06/16/2022

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Learning visual representations through self-supervision is an extremely...

0 Fatemeh Saleh, et al. ∙

research

∙ 05/13/2022

Knowledge Distillation Meets Open-Set Semi-Supervised Learning

Existing knowledge distillation methods mostly focus on distillation of ...

13 Jing Yang, et al. ∙

research

∙ 05/06/2022

EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers

Self-attention based models such as vision transformers (ViTs) have emer...

18 Junting Pan, et al. ∙

research

∙ 04/10/2022

SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition

Learning an egocentric action recognition model from video data is chall...

14 Victor Escorcia, et al. ∙

research

∙ 10/06/2021

SAIC_Cambridge-HuPBA-FBK Submission to the EPIC-Kitchens-100 Action Recognition Challenge 2021

This report presents the technical details of our submission to the EPIC...

0 Swathikiran Sudhakaran, et al. ∙

research

∙ 06/10/2021

Space-time Mixing Attention for Video Transformer

This paper is on video recognition using Transformers. Very recent attem...

0 Adrian Bulat, et al. ∙

research

∙ 03/28/2021

Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization

Temporal action localization (TAL) is a fundamental yet challenging task...

7 Mengmeng Xu, et al. ∙

research

∙ 01/20/2021

Few-shot Action Recognition with Prototype-centered Attentive Learning

Few-shot action recognition aims to recognize action classes with few tr...

5 Xiatian Zhu, et al. ∙

research

∙ 11/21/2020

Boundary-sensitive Pre-training for Temporal Localization in Videos

Many video analysis tasks require temporal localization thus detection o...

1 Mengmeng Xu, et al. ∙

research

∙ 10/07/2020

High-Capacity Expert Binary Networks

Network binarization is a promising hardware-aware direction for creatin...

0 Adrian Bulat, et al. ∙

research

∙ 07/13/2020

Towards practical lipreading with distilled and efficient models

Lipreading has witnessed a lot of progress due to the resurgence of neur...

0 Pingchuan Ma, et al. ∙

research

∙ 07/03/2020

Egocentric Action Recognition by Video Attention and Temporal Context

We present the submission of Samsung AI Centre Cambridge to the CVPR2020...

2 Juan-Manuel Pérez-Rúa, et al. ∙

research

∙ 04/02/2020

Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention

Attentive video modeling is essential for action recognition in unconstr...

14 Juan-Manuel Pérez-Rúa, et al. ∙

research

∙ 03/25/2020

Training Binary Neural Networks with Real-to-Binary Convolutions

This paper shows how to train binary networks to within a few percent po...

9 Brais Martinez, et al. ∙

research

∙ 03/09/2020

Knowledge distillation via adaptive instance normalization

This paper addresses the problem of model compression via knowledge dist...

17 Jing Yang, et al. ∙

research

∙ 03/03/2020

BATS: Binary ArchitecTure Search

This paper proposes Binary ArchitecTure Search (BATS), a framework that ...

31 Adrian Bulat, et al. ∙

research

∙ 01/23/2020

Lipreading using Temporal Convolutional Networks

Lip-reading has attracted a lot of research attention lately thanks to a...

13 Brais Martinez, et al. ∙

research

∙ 08/20/2019

Action recognition with spatial-temporal discriminative filter banks

Action recognition has seen a dramatic performance improvement in the la...

26 Brais Martinez, et al. ∙

research

∙ 01/17/2017

Fusing Deep Learned and Hand-Crafted Features of Appearance, Shape, and Dynamics for Automatic Pain Estimation

Automatic continuous time, continuous value assessment of a patient's pa...

0 Joy Egede, et al. ∙

research

∙ 12/07/2016

A Functional Regression approach to Facial Landmark Tracking

Linear regression is a fundamental building block in many face detection...

0 Enrique Sánchez-Lozano, et al. ∙

research

∙ 08/03/2016

Cascaded Continuous Regression for Real-time Incremental Face Tracking

This paper introduces a novel real-time algorithm for facial landmark tr...

0 Enrique Sánchez-Lozano, et al. ∙

Brais Martinez

Featured Co-authors

Sign in with Google

Consider DeepAI Pro