Leonid Sigal

research

∙ 07/26/2023

Uncertainty Guided Adaptive Warping for Robust and Efficient Stereo Matching

Correlation based stereo matching has achieved outstanding performance, ...

0 Junpeng Jing, et al. ∙

research

∙ 07/15/2023

INVE: Interactive Neural Video Editing

We present Interactive Neural Video Editing (INVE), a real-time video ed...

0 Jiahui Huang, et al. ∙

research

∙ 03/14/2023

Implicit and Explicit Commonsense for Multi-sentence Video Captioning

Existing dense or paragraph video captioning approaches rely on holistic...

0 Shih-Han Chou, et al. ∙

research

∙ 02/16/2023

MINOTAUR: Multi-task Video Grounding From Multimodal Queries

Video understanding tasks take many forms, from action detection to visu...

0 Raghav Goyal, et al. ∙

research

∙ 02/14/2023

Frustratingly Simple but Effective Zero-shot Detection and Segmentation: Analysis and a Strong Baseline

Methods for object detection and segmentation often require abundant ins...

0 Siddhesh Khandelwal, et al. ∙

research

∙ 02/02/2023

Self-Supervised Relation Alignment for Scene Graph Generation

The goal of scene graph generation is to predict a graph from an input i...

0 Bicheng Xu, et al. ∙

research

∙ 01/03/2023

Vocabulary-informed Zero-shot and Open-set Learning

Despite significant progress in object categorization, in recent years, ...

0 Yanwei Fu, et al. ∙

research

∙ 12/06/2022

Semantically Enhanced Global Reasoning for Semantic Segmentation

Recent advances in pixel-level tasks (e.g., segmentation) illustrate the...

0 Mir Rayat Imtiaz Hossain, et al. ∙

research

∙ 11/28/2022

GraphPNAS: Learning Distribution of Good Neural Architectures via Deep Graph Generative Models

Neural architectures can be naturally viewed as computational graphs. Mo...

0 Muchen Li, et al. ∙

research

∙ 11/23/2022

Make-A-Story: Visual Memory Conditioned Consistent Story Generation

There has been a recent explosion of impressive generative models that c...

0 Tanzila Rahman, et al. ∙

research

∙ 10/24/2022

VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge

There has been a growing interest in solving Visual Question Answering (...

0 Sahithya Ravi, et al. ∙

research

∙ 10/04/2022

Real-Time Monitoring of User Stress, Heart Rate and Heart Rate Variability on Mobile Devices

Stress is considered to be the epidemic of the 21st-century. Yet, mobile...

12 Peyman Bateni, et al. ∙

research

∙ 07/27/2022

Iterative Scene Graph Generation

The task of scene graph generation entails identifying object entities a...

0 Siddhesh Khandelwal, et al. ∙

research

∙ 07/21/2022

Generalizable Patch-Based Neural Rendering

Neural rendering has received tremendous attention since the advent of N...

0 Mohammed Suhail, et al. ∙

research

∙ 03/22/2022

Self-supervision through Random Segments with Autoregressive Coding (RandSAC)

Inspired by the success of self-supervised autoregressive representation...

1 Tianyu Hua, et al. ∙

research

∙ 01/13/2022

Beyond Simple Meta-Learning: Multi-Purpose Models for Multi-Domain, Active and Continual Few-Shot Learning

Modern deep learning requires large-scale extensively labelled datasets ...

49 Peyman Bateni, et al. ∙

research

∙ 12/17/2021

Light Field Neural Rendering

Classical light field rendering for novel view synthesis can accurately ...

0 Mohammed Suhail, et al. ∙

research

∙ 11/24/2021

Layered Controllable Video Generation

We introduce layered controllable video generation, where we, without an...

0 Jiahui Huang, et al. ∙

research

∙ 10/26/2021

TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation

The recent success of transformer models in language, such as BERT, has ...

0 Tanzila Rahman, et al. ∙

research

∙ 06/06/2021

Referring Transformer: A One-step Approach to Multi-task Visual Grounding

As an important step towards visual reasoning, visual grounding (e.g., p...

0 Muchen Li, et al. ∙

research

∙ 04/29/2021

Segmentation-grounded Scene Graph Generation

Scene graph generation has emerged as an important problem in computer v...

0 Siddhesh Khandelwal, et al. ∙

research

∙ 03/25/2021

Weakly-supervised Audio-visual Sound Source Detection and Separation

Learning how to localize and separate individual object sounds in the au...

0 Tanzila Rahman, et al. ∙

research

∙ 03/03/2021

Energy-Based Learning for Scene Graph Generation

Traditional scene graph generation methods are trained using cross-entro...

0 Mohammed Suhail, et al. ∙

research

∙ 11/04/2020

An Improved Attention for Visual Question Answering

We consider the problem of Visual Question Answering (VQA). Given an ima...

2 Tanzila Rahman, et al. ∙

research

∙ 08/28/2020

Person-in-Context Synthesiswith Compositional Structural Space

Despite significant progress, controlled generation of complex images wi...

1 Weidong Yin, et al. ∙

research

∙ 08/27/2020

Attribute-guided image generation from layout

Recent approaches have achieved great success in image generation from s...

0 Ke Ma, et al. ∙

research

∙ 06/25/2020

Unsupervised Video Decomposition using Spatio-temporal Iterative Inference

Unsupervised multi-object scene decomposition is a fast-emerging problem...

0 Polina Zablotskaia, et al. ∙

research

∙ 06/23/2020

Discriminative Feature Alignment: Improving Transferability of Unsupervised Domain Adaptation by Gaussian-guided Latent Alignment

In this study, we focus on the unsupervised domain adaptation problem wh...

0 Jing Wang, et al. ∙

research

∙ 06/23/2020

Discriminative Feature Alignment: ImprovingTransferability of Unsupervised DomainAdaptation by Gaussian-guided LatentAlignment

In this study, we focus on the unsupervised domain adaptation problem wh...

0 Jing Wang, et al. ∙

research

∙ 06/12/2020

Weakly-supervised Any-shot Object Detection

Methods for object detection and segmentation rely on large scale instan...

0 Siddhesh Khandelwal, et al. ∙

research

∙ 04/02/2020

Consistent Multiple Sequence Decoding

Sequence decoding is one of the core components of most visual-lingual m...

0 Bicheng Xu, et al. ∙

research

∙ 02/24/2020

Variational Hyper RNN for Sequence Modeling

In this work, we propose a novel probabilistic sequence model that excel...

23 Ruizhi Deng, et al. ∙

research

∙ 12/23/2019

Front2Back: Single View 3D Shape Reconstruction via Front to Back Prediction

Reconstruction of a 3D shape from a single 2D image is a classical compu...

0 Yuan Yao, et al. ∙

research

∙ 12/07/2019

Improved Few-Shot Visual Classification

Few-shot learning is a fundamental task in computer vision that carries ...

1 Peyman Bateni, et al. ∙

research

∙ 12/05/2019

Zero-Shot Generation of Human-Object Interaction Videos

Generation of videos of complex scenes is an important open problem in c...

9 Megha Nawhal, et al. ∙

research

∙ 11/29/2019

OptiBox: Breaking the Limits of Proposals for Visual Grounding

The problem of language grounding has attracted much attention in recent...

0 Zicong Fan, et al. ∙

research

∙ 10/21/2019

DwNet: Dense warp-based network for pose-guided human video generation

Generation of realistic high-resolution videos of human subjects is a ch...

14 Polina Zablotskaia, et al. ∙

research

∙ 09/22/2019

Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning

Multi-modal learning, particularly among imaging and linguistic modaliti...

0 Tanzila Rahman, et al. ∙

research

∙ 07/24/2019

LayoutVAE: Stochastic Scene Layout Generation from a Label Set

Recently there is an increasing interest in scene generation within the ...

3 Akash Abdu Jyothi, et al. ∙

research

∙ 05/22/2019

AttentionRNN: A Structured Spatial Attention Mechanism

Visual attention mechanisms have proven to be integrally important const...

31 Siddhesh Khandelwal, et al. ∙

research

∙ 04/05/2019

A Variational Auto-Encoder Model for Stochastic Point Processes

We propose a novel probabilistic generative model for action sequences. ...

0 Nazanin Mehrasa, et al. ∙

research

∙ 03/18/2019

Neural Sequential Phrase Grounding (SeqGROUND)

We propose an end-to-end approach for phrase grounding in images. Unlike...

8 Pelin Dogan, et al. ∙

research

∙ 12/04/2018

Walking on Thin Air: Environment-Free Physics-based Markerless Motion Capture

We propose a generative approach to physics-based motion capture. Unlike...

0 Micha Livne, et al. ∙

research

∙ 12/01/2018

Towards Traversing the Continuous Spectrum of Image Retrieval

Image retrieval is one of the most popular tasks in computer vision. How...

0 Ziad Al-Halah, et al. ∙

research

∙ 11/28/2018

Image Generation from Layout

Despite significant recent progress on generative models, controlled gen...

0 Bo Zhao, et al. ∙

research

∙ 10/28/2018

Middle-Out Decoding

Despite being virtually ubiquitous, sequence-to-sequence models are chal...

0 Shikib Mehri, et al. ∙

research

∙ 10/17/2018

Learning to Separate Domains in Generalized Zero-Shot and Open Set Learning: a probabilistic perspective

This paper studies the problem of domain division problem which aims to ...

0 HanZe Dong, et al. ∙

research

∙ 10/01/2018

Where and When to Look? Spatio-temporal Attention for Action Recognition in Videos

Inspired by the observation that humans are able to process videos effic...

0 Lili Meng, et al. ∙

research

∙ 04/15/2018

Semantic Feature Augmentation in Few-shot Learning

A fundamental problem with few-shot learning is the scarcity of data in ...

0 Zitian Chen, et al. ∙

research

∙ 04/13/2018

Text-to-Clip Video Retrieval with Early Fusion and Re-Captioning

We propose a novel method capable of retrieving clips from untrimmed vid...

0 Huijuan Xu, et al. ∙

Leonid Sigal

Featured Co-authors

Sign in with Google

Consider DeepAI Pro