Yusuf Aytar

research

∙ 08/30/2023

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation

For robots to be useful outside labs and specialized factories we need a...

0 Mel Večerík, et al. ∙

research

∙ 06/20/2023

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation

The ability to leverage heterogeneous robotic experience from different ...

0 Konstantinos Bousmalis, et al. ∙

research

∙ 06/14/2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement

We present a novel model for Tracking Any Point (TAP) that effectively t...

0 Carl Doersch, et al. ∙

research

∙ 05/23/2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

We propose a novel multimodal video benchmark - the Perception Test - to...

0 Viorica Patraucean, et al. ∙

research

∙ 04/13/2023

Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation

Recent works have shown that large models pretrained on common visual le...

1 Mohit Sharma, et al. ∙

research

∙ 11/07/2022

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Generic motion understanding from video involves not only tracking objec...

0 Carl Doersch, et al. ∙

research

∙ 12/09/2021

Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies

For robots operating in the real world, it is desirable to learn reusabl...

0 Dushyant Rao, et al. ∙

research

∙ 12/01/2021

Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation

Complex sequential tasks in continuous-control settings often require ag...

7 Todor Davchev, et al. ∙

research

∙ 04/29/2021

With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations

Self-supervised learning algorithms based on instance discrimination tra...

4 Debidatta Dwibedi, et al. ∙

research

∙ 03/16/2021

Manipulator-Independent Representations for Visual Imitation

Imitation learning is an effective tool for robotic learning tasks where...

0 Yuxiang Zhou, et al. ∙

research

∙ 01/21/2021

Learning rich touch representations through cross-modal self-supervision

The sense of touch is fundamental in several manipulation tasks, but rar...

0 Martina Zambelli, et al. ∙

research

∙ 12/12/2020

Semi-supervised reward learning for offline reinforcement learning

In offline reinforcement learning (RL) agents are trained using a logged...

12 Ksenia Konyushkova, et al. ∙

research

∙ 11/27/2020

Offline Learning from Demonstrations and Unlabeled Experience

Behavior cloning (BC) is often practical for robot learning because it a...

6 Konrad Zolna, et al. ∙

research

∙ 11/06/2020

Large-scale multilingual audio visual dubbing

We describe a system for large-scale audiovisual translation and dubbing...

3 Yi Yang, et al. ∙

research

∙ 06/27/2020

Counting Out Time: Class Agnostic Video Repetition Counting in the Wild

We present an approach for estimating the period with which an action is...

4 Debidatta Dwibedi, et al. ∙

research

∙ 10/21/2019

Self-Supervised Sim-to-Real Adaptation for Visual Robotic Manipulation

Collecting and automatically obtaining reward signals from real robotic ...

0 Rae Jeong, et al. ∙

research

∙ 09/26/2019

A Framework for Data-Driven Robotics

We present a framework for data-driven robotics that makes use of a larg...

0 Serkan Cabi, et al. ∙

research

∙ 04/16/2019

Temporal Cycle-Consistency Learning

We introduce a self-supervised representation learning method based on t...

2 Debidatta Dwibedi, et al. ∙

research

∙ 10/14/2018

Recipe1M: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

In this paper, we introduce Recipe1M, a new large-scale, structured corp...

0 Javier Marin, et al. ∙

research

∙ 10/11/2018

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL

Humans are experts at high-fidelity imitation -- closely mimicking a dem...

4 Tom Le Paine, et al. ∙

research

∙ 05/29/2018

Playing hard exploration games by watching YouTube

Deep reinforcement learning methods traditionally struggle with tasks wh...

2 Yusuf Aytar, et al. ∙

research

∙ 08/23/2017

Exploiting Convolution Filter Patterns for Transfer Learning

In this paper, we introduce a new regularization technique for transfer ...

0 Mehmet Aygün, et al. ∙

research

∙ 06/03/2017

See, Hear, and Read: Deep Aligned Representations

We capitalize on large amounts of readily-available, synchronous data to...

0 Yusuf Aytar, et al. ∙

research

∙ 03/09/2017

Face-to-BMI: Using Computer Vision to Infer Body Mass Index on Social Media

A person's weight status can have profound implications on their life, r...

0 Enes Kocabey, et al. ∙

research

∙ 02/21/2017

Is Saki #delicious? The Food Perception Gap on Instagram and Its Relation to Health

Food is an integral part of our life and what and how much we eat crucia...

0 Ferda Ofli, et al. ∙

research

∙ 10/27/2016

Cross-Modal Scene Networks

People can recognize scenes across many different modalities beyond natu...

0 Yusuf Aytar, et al. ∙

research

∙ 10/27/2016

SoundNet: Learning Sound Representations from Unlabeled Video

We learn rich natural sound representations by capitalizing on large amo...

0 Yusuf Aytar, et al. ∙

research

∙ 10/01/2016

How Transferable are CNN-based Features for Age and Gender Classification?

Age and gender are complementary soft biometric traits for face recognit...

0 Gökhan Özbulak, et al. ∙

research

∙ 07/25/2016

Learning Aligned Cross-Modal Representations from Weakly Aligned Data

People can recognize scenes across many different modalities beyond natu...

0 Lluis Castrejon, et al. ∙

Yusuf Aytar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro