Jonathan Huang

research

∙ 06/07/2023

Optimizing ViViT Training: Time and Memory Reduction for Action Recognition

In this paper, we address the challenges posed by the substantial traini...

0 Shreyank N Gowda, et al. ∙

research

∙ 06/02/2023

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Observing the close relationship among panoptic, semantic and instance s...

0 Xiuye Gu, et al. ∙

research

∙ 05/03/2023

Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations

This work investigates pretrained audio representations for few shot Sou...

0 Vasudha Kowtha, et al. ∙

research

∙ 05/27/2021

Doubly robust, machine learning effect estimation in real-world clinical sciences: A practical evaluation of performance in molecular epidemiology cohort settings

Modern efficient estimators such as AIPW and TMLE facilitate the applica...

0 Xiang Meng, et al. ∙

research

∙ 04/06/2021

Local Metrics for Multi-Object Tracking

This paper introduces temporally local metrics for Multi-Object Tracking...

21 Jack Valmadre, et al. ∙

research

∙ 04/01/2021

The surprising impact of mask-head architecture on novel class segmentation

Instance segmentation models today are very accurate when trained on lar...

10 Vighnesh Birodkar, et al. ∙

research

∙ 10/12/2020

The implications of outcome truncation in reproductive medicine RCTs: a simulation platform for trialists and simulation study

Randomised controlled trials in reproductive medicine are often subject ...

0 Jack Wilkinson, et al. ∙

research

∙ 09/28/2020

PERF-Net: Pose Empowered RGB-Flow Net

In recent years, many works in the video action recognition literature h...

1 Yinxiao Li, et al. ∙

research

∙ 09/17/2020

Utterance-level Intent Recognition from Keywords

This paper focuses on wake on intent (WOI) techniques for platforms with...

1 Wenda Chen, et al. ∙

research

∙ 08/11/2020

Compact Speaker Embedding: lrx-vector

Deep neural networks (DNN) have recently been widely used in speaker rec...

0 Munir Georges, et al. ∙

research

∙ 03/30/2020

RetinaTrack: Online Single Stage Joint Detection and Tracking

Traditionally multi-object tracking and object detection are performed u...

13 Zhichao Lu, et al. ∙

research

∙ 12/20/2019

Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog

We are witnessing a confluence of vision, speech and dialog system techn...

0 Shachi H Kumar, et al. ∙

research

∙ 12/20/2019

Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog

With the recent advancements in Artificial Intelligence (AI), Intelligen...

0 Shachi H Kumar, et al. ∙

research

∙ 12/07/2019

Long Term Temporal Context for Per-Camera Object Detection

In static monitoring cameras, useful contextual information can stretch ...

1 Sara Beery, et al. ∙

research

∙ 10/25/2019

Structural sparsification for Far-field Speaker Recognition with GNA

Recently, deep neural networks (DNN) have been widely used in speaker re...

0 Jingchi Zhang, et al. ∙

research

∙ 12/20/2018

Context, Attention and Audio Feature Explorations for Audio Visual Scene-Aware Dialog

With the recent advancements in AI, Intelligent Virtual Assistants (IVA)...

0 Shachi H Kumar, et al. ∙

research

∙ 11/27/2018

Uncertainty aware multimodal activity recognition with Bayesian inference

Deep neural networks (DNNs) provide state-of-the-art results for a multi...

0 Mahesh Subedar, et al. ∙

research

∙ 06/07/2018

Multimodal Relational Tensor Network for Sentiment and Emotion Classification

Understanding Affect from video segments has brought researchers from th...

0 Saurav Sahay, et al. ∙

research

∙ 03/16/2018

Learning to Segment via Cut-and-Paste

This paper presents a weakly-supervised approach to object instance segm...

0 Tal Remez, et al. ∙

research

∙ 12/13/2017

Rethinking Spatiotemporal Feature Learning For Video Understanding

In this paper we study 3D convolutional networks for video understanding...

0 Saining Xie, et al. ∙

research

∙ 12/02/2017

Progressive Neural Architecture Search

We propose a method for learning CNN structures that is more efficient t...

0 Chenxi Liu, et al. ∙

research

∙ 05/30/2017

Generative Models of Visually Grounded Imagination

It is easy for people to imagine what a man with pink hair looks like, e...

0 Ramakrishna Vedantam, et al. ∙

research

∙ 05/05/2017

Motion Prediction Under Multimodality with Conditional Stochastic Networks

Given a visual history, multiple future outcomes for a video scene are e...

0 Katerina Fragkiadaki, et al. ∙

research

∙ 12/07/2016

Spatially Adaptive Computation Time for Residual Networks

This paper proposes a deep learning architecture based on Residual Netwo...

0 Michael Figurnov, et al. ∙

research

∙ 11/30/2016

Speed/accuracy trade-offs for modern convolutional object detectors

The goal of this paper is to serve as a guide for selecting a detection ...

0 Jonathan Huang, et al. ∙

research

∙ 11/09/2015

Detecting events and key actors in multi-person videos

Multi-person event recognition is a challenging task, often with many pe...

0 Vignesh Ramanathan, et al. ∙

research

∙ 11/07/2015

Generation and Comprehension of Unambiguous Object Descriptions

We propose a method that can generate an unambiguous description (known ...

0 Junhua Mao, et al. ∙

research

∙ 06/19/2015

Deep Knowledge Tracing

Knowledge tracing---where a machine models the knowledge of a student as...

1 Chris Piech, et al. ∙

research

∙ 05/22/2015

Learning Program Embeddings to Propagate Feedback on Student Code

Providing feedback, both assessing final work and giving hints to stuck ...

0 Chris Piech, et al. ∙

research

∙ 03/05/2015

What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision

We present a novel method for aligning a sequence of instructions to a v...

0 Jonathan Malmaud, et al. ∙

research

∙ 07/09/2013

Tuned Models of Peer Assessment in MOOCs

In massive open online courses (MOOCs), peer grading serves as a critica...

0 Chris Piech, et al. ∙

research

∙ 02/14/2012

Efficient Probabilistic Inference with Partial Ranking Queries

Distributions over rankings are used to model data in various settings s...

0 Jonathan Huang, et al. ∙

research

∙ 06/07/2010

Uncovering the Riffled Independence Structure of Rankings

Representing distributions over permutations can be a daunting task due ...

0 Jonathan Huang, et al. ∙

Jonathan Huang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro