Stephen Gould

research

∙ 07/23/2023

Learning Navigational Visual Representations with Semantic Map Supervision

Being able to perceive the semantics and the spatial structure of the en...

0 Yicong Hong, et al. ∙

research

∙ 06/26/2023

PMaF: Deep Declarative Layers for Principal Matrix Features

We explore two differentiable deep declarative layers, namely least squa...

0 Zhiwei Xu, et al. ∙

research

∙ 06/24/2023

Towards Understanding Gradient Approximation in Equality Constrained Deep Declarative Networks

We explore conditions for when the gradient of a deep declarative node c...

0 Stephen Gould, et al. ∙

research

∙ 05/25/2023

Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder

Composed image retrieval aims to find an image that best matches a given...

0 Zheyuan Liu, et al. ∙

research

∙ 04/18/2023

GoferBot: A Visual Guided Human-Robot Collaborative Assembly System

The current transformation towards smart manufacturing has led to a grow...

0 Zheyu Zhuang, et al. ∙

research

∙ 03/30/2023

Adaptive Cross Batch Normalization for Metric Learning

Metric learning is a fundamental problem in computer vision whereby a mo...

9 Thalaiyasingam Ajanthan, et al. ∙

research

∙ 03/29/2023

Bi-directional Training for Composed Image Retrieval via Text Prompt Learning

Composed image retrieval searches for a target image based on a multi-mo...

0 Zheyuan Liu, et al. ∙

research

∙ 03/24/2023

Aligning Step-by-Step Instructional Diagrams to Video Demonstrations

Multimodal alignment facilitates the retrieval of instances from one mod...

0 Jiahao Zhang, et al. ∙

research

∙ 03/19/2023

Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths

This paper addresses learning end-to-end models for time series data tha...

0 Ming Xu, et al. ∙

research

∙ 03/11/2023

3DInAction: Understanding Human Actions in 3D Point Clouds

We propose a novel method for 3D point cloud action recognition. Underst...

0 Yizhak Ben-Shabat, et al. ∙

research

∙ 02/02/2023

Confidence and Dispersity Speak: Characterising Prediction Matrix for Unsupervised Accuracy Estimation

This work aims to assess how well a model performs under distribution sh...

0 Weijian Deng, et al. ∙

research

∙ 12/22/2022

Understanding and Improving the Role of Projection Head in Self-Supervised Learning

Self-supervised learning (SSL) aims to produce useful feature representa...

4 Kartik Gupta, et al. ∙

research

∙ 12/07/2022

NeRFEditor: Differentiable Style Decomposition for Full 3D Scene Editing

We present NeRFEditor, an efficient learning framework for 3D scene edit...

0 Chunyi Sun, et al. ∙

research

∙ 11/30/2022

High-Fidelity Guided Image Synthesis with Latent Diffusion Models

Controllable image synthesis with user scribbles has gained huge public ...

0 Jaskirat Singh, et al. ∙

research

∙ 07/14/2022

On the Strong Correlation Between Model Invariance and Generalization

Generalization and invariance are two essential properties of any machin...

5 Weijian Deng, et al. ∙

research

∙ 03/05/2022

Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation

Most existing works in vision-and-language navigation (VLN) focus on eit...

0 Yicong Hong, et al. ∙

research

∙ 02/24/2022

Exploiting Problem Structure in Deep Declarative Networks: Two Case Studies

Deep declarative networks and other recent related works have shown how ...

0 Stephen Gould, et al. ∙

research

∙ 12/03/2021

Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer

Recent developments in transformer models for visual data have led to si...

5 Frederic Z. Zhang, et al. ∙

research

∙ 10/06/2021

A Regularized Wasserstein Framework for Graph Kernels

We propose a learning framework for graph kernels, which is theoreticall...

0 Asiri Wijesinghe, et al. ∙

research

∙ 08/09/2021

Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

We extend the task of composed image retrieval, where an input query con...

0 Zheyuan Liu, et al. ∙

research

∙ 06/21/2021

DiGS : Divergence guided shape implicit neural representation for unoriented point clouds

Neural shape representations have recently shown to be effective in shap...

0 Yizhak Ben-Shabat, et al. ∙

research

∙ 06/10/2021

What Does Rotation Prediction Tell Us about Classifier Accuracy under Varying Testing Environments?

Understanding classifier decision under novel environments is central to...

0 Weijian Deng, et al. ∙

research

∙ 01/02/2021

Semantics for Robotic Mapping, Perception and Interaction: A Survey

For robots to navigate and interact more richly with the world around th...

0 Sourav Garg, et al. ∙

research

∙ 12/12/2020

Fine-grained Classification via Categorical Memory Networks

Motivated by the desire to exploit patterns shared across classes, we pr...

0 Weijian Deng, et al. ∙

research

∙ 12/11/2020

Spatio-attentive Graphs for Human-Object Interaction Detection

We address the problem of detecting human–object interactions in images ...

3 Frederic Z. Zhang, et al. ∙

research

∙ 12/03/2020

Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking

Despite the recent advances in multiple object tracking (MOT), achieved ...

0 Fatemeh Saleh, et al. ∙

research

∙ 11/26/2020

A Recurrent Vision-and-Language BERT for Navigation

Accuracy of many visiolinguistic tasks has benefited significantly from ...

0 Yicong Hong, et al. ∙

research

∙ 11/25/2020

How to train your conditional GAN: An approach using geometrically structured latent manifolds

Conditional generative modeling typically requires capturing one-to-many...

0 Sameera Ramasinghe, et al. ∙

research

∙ 10/19/2020

Language and Visual Entity Relationship Graph for Agent Navigation

Vision-and-Language Navigation (VLN) requires an agent to navigate in a ...

0 Yicong Hong, et al. ∙

research

∙ 10/13/2020

DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video

This paper studies the task of temporal moment localization in a long un...

0 Cristian Rodriguez Opazo, et al. ∙

research

∙ 10/07/2020

Conditional Generative Modeling via Learning the Latent Space

Although deep learning has achieved appealing results on several machine...

12 Sameera Ramasinghe, et al. ∙

research

∙ 07/29/2020

Solving the Blind Perspective-n-Point Problem End-To-End With Robust Differentiable Geometric Optimization

Blind Perspective-n-Point (PnP) is the problem of estimating the positio...

0 Dylan Campbell, et al. ∙

research

∙ 07/01/2020

The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose

The availability of a large labeled dataset is a key requirement for app...

0 Yizhak Ben-Shabat, et al. ∙

research

∙ 06/22/2020

Bidirectional Self-Normalizing Neural Networks

The problem of exploding and vanishing gradients has been a long-standin...

11 Yao Lu, et al. ∙

research

∙ 05/27/2020

A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews

Despite the recent advances in opinion mining for written reviews, few w...

13 Edison Marrese-Taylor, et al. ∙

research

∙ 04/28/2020

Inferring Temporal Compositions of Actions Using Probabilistic Automata

This paper presents a framework to recognize temporal compositions of at...

6 Rodrigo Santa Cruz, et al. ∙

research

∙ 04/16/2020

ArTIST: Autoregressive Trajectory Inpainting and Scoring for Tracking

One of the core components in online multiple object tracking (MOT) fram...

0 Fatemeh Saleh, et al. ∙

research

∙ 04/06/2020

Sub-Instruction Aware Vision-and-Language Navigation

Vision-and-language navigation requires an agent to navigate through a r...

0 Yicong Hong, et al. ∙

research

∙ 03/23/2020

DeepFit: 3D Surface Fitting via Neural Network Weighted Least Squares

We propose a surface fitting method for unstructured 3D point clouds. Th...

8 Yizhak Ben-Shabat, et al. ∙

research

∙ 03/17/2020

Learning to Structure an Image with Few Colors

Color and structure are the two pillars that construct an image. Usually...

0 Yunzhong Hou, et al. ∙

research

∙ 02/26/2020

Joint Unsupervised Learning of Optical Flow and Egomotion with Bi-Level Optimization

We address the problem of joint optical flow and camera motion estimatio...

3 Shihao Jiang, et al. ∙

research

∙ 12/18/2019

Sampling Good Latent Variables via CPP-VAEs: VAEs with Condition Posterior as Prior

In practice, conditional variational autoencoders (CVAEs) perform condit...

0 Sadegh Aliakbarian, et al. ∙

research

∙ 12/04/2019

Spectral-GANs for High-Resolution 3D Point-cloud Generation

Point-clouds are a popular choice for vision and graphics tasks due to t...

20 Sameera Ramasinghe, et al. ∙

research

∙ 11/30/2019

Representation Learning on Unit Ball with 3D Roto-Translational Equivariance

Convolution is an integral operation that defines how the shape of one f...

0 Sameera Ramasinghe, et al. ∙

research

∙ 09/11/2019

Deep Declarative Networks: A New Hope

We introduce a new class of end-to-end learnable models wherein data pro...

38 Stephen Gould, et al. ∙

research

∙ 08/24/2019

Blended Convolution and Synthesis for Efficient Discrimination of 3D Shapes

Existing networks directly learn feature representations on 3D point clo...

0 Sameera Ramasinghe, et al. ∙

research

∙ 08/20/2019

Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention

This paper studies the problem of temporal moment localization in a long...

0 Cristian Rodriguez Opazo, et al. ∙

research

∙ 08/02/2019

Learning Variations in Human Motion via Mix-and-Match Perturbation

Human motion prediction is a stochastic process: Given an observed seque...

0 Mohammad Sadegh Aliakbarian, et al. ∙

research

∙ 06/14/2019

A Signal Propagation Perspective for Pruning Neural Networks at Initialization

Network pruning is a promising avenue for compressing deep neural networ...

0 Namhoon Lee, et al. ∙

research

∙ 04/29/2019

Learning to Find Common Objects Across Image Collections

We address the problem of finding a set of images containing a common, b...

28 Amirreza Shaban, et al. ∙

Stephen Gould

Featured Co-authors

Sign in with Google

Consider DeepAI Pro