Angela Yao

research

∙ 09/04/2023

Can I Trust Your Answer? Visually Grounded Video Question Answering

We study visually grounded VideoQA in response to the emerging trends of...

0 Junbin Xiao, et al. ∙

research

∙ 08/25/2023

HiFiHR: Enhancing 3D Hand Reconstruction from a Single Image via High-Fidelity Texture

We present HiFiHR, a high-fidelity hand reconstruction approach that uti...

0 Jiayin Zhu, et al. ∙

research

∙ 08/22/2023

Opening the Vocabulary of Egocentric Actions

Human actions in egocentric videos are often hand-object interactions co...

0 Dibyadip Chatterjee, et al. ∙

research

∙ 07/31/2023

Every Mistake Counts in Assembly

One promising use case of AI assistants is to help with complex procedur...

0 Guodong Ding, et al. ∙

research

∙ 05/01/2023

Overcoming the Trade-off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction

Direct mesh fitting for 3D hand shape reconstruction is highly accurate....

0 Ziwei Yu, et al. ∙

research

∙ 04/29/2023

An Implicit Alignment for Video Super-Resolution

Video super-resolution commonly uses a frame-wise alignment to support t...

0 Kai Xu, et al. ∙

research

∙ 02/27/2023

Contrastive Video Question Answering via Video Graph Transformer

We propose to perform video question answering (VideoQA) in a Contrastiv...

0 Junbin Xiao, et al. ∙

research

∙ 01/25/2023

Bias-Compensated Integral Regression for Human Pose Estimation

In human and hand pose estimation, heatmaps are a crucial intermediate r...

0 Kerui Gu, et al. ∙

research

∙ 01/21/2023

Improving Deep Regression with Ordinal Entropy

In computer vision, it is often observed that formulating regression pro...

0 Shihao Zhang, et al. ∙

research

∙ 12/20/2022

C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation

Temporal action segmentation tags action labels for every frame in an in...

0 Dipika Singhania, et al. ∙

research

∙ 11/24/2022

UV-Based 3D Hand-Object Reconstruction with Grasp Optimization

We propose a novel framework for 3D hand shape reconstruction and hand-o...

0 Ziwei Yu, et al. ∙

research

∙ 10/19/2022

Temporal Action Segmentation: An Analysis of Modern Technique

Temporal action segmentation from videos aims at the dense labeling of v...

0 Guodong Ding, et al. ∙

research

∙ 08/05/2022

Perception-Distortion Balanced ADMM Optimization for Single-Image Super-Resolution

In image super-resolution, both pixel-wise accuracy and perceptual fidel...

8 Yuehan Zhang, et al. ∙

research

∙ 07/20/2022

A Generalized Robust Framework For Timestamp Supervision in Temporal Action Segmentation

In temporal action segmentation, Timestamp supervision requires only a h...

0 Rahul Rahaman, et al. ∙

research

∙ 07/20/2022

Discrete-Constrained Regression for Local Counting Models

Local counts, or the number of objects in a local area, is a continuous ...

0 Haipeng Xiong, et al. ∙

research

∙ 07/18/2022

Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation

We present a semi-supervised learning approach to the temporal action se...

0 Guodong Ding, et al. ∙

research

∙ 04/28/2022

A Closer Look at Branch Classifiers of Multi-exit Architectures

Multi-exit architectures consist of a backbone and branch classifiers th...

15 Shaohui Lin, et al. ∙

research

∙ 04/07/2022

TemporalUV: Capturing Loose Clothing with Temporally Coherent UV Coordinates

We propose a novel approach to generate temporally coherent UV coordinat...

0 You Xie, et al. ∙

research

∙ 04/06/2022

Multi-Scale Memory-Based Video Deblurring

Video deblurring has achieved remarkable progress thanks to the success ...

0 Bo Ji, et al. ∙

research

∙ 03/28/2022

Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities

Assembly101 is a new procedural activity dataset featuring 4321 videos o...

3 Fadime Sener, et al. ∙

research

∙ 02/28/2022

DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training

A standard hardware bottleneck when training deep neural networks is GPU...

0 Joya Chen, et al. ∙

research

∙ 12/13/2021

Local and Global Point Cloud Reconstruction for 3D Hand Pose Estimation

This paper addresses the 3D point cloud reconstruction and 3D pose estim...

0 Ziwei Yu, et al. ∙

research

∙ 12/12/2021

Video as Conditional Graph Hierarchy for Multi-Granular Question Answering

Video question answering requires the models to understand and reason ab...

0 Junbin Xiao, et al. ∙

research

∙ 12/02/2021

Iterative Frame-Level Representation Learning And Classification For Semi-Supervised Temporal Action Segmentation

Temporal action segmentation classifies the action of each frame in (lon...

9 Dipika Singhania, et al. ∙

research

∙ 11/15/2021

Weakly-Supervised Dense Action Anticipation

Dense anticipation aims to forecast future actions and their durations f...

18 Haotong Zhang, et al. ∙

research

∙ 08/15/2021

Temporal Action Segmentation with High-level Complex Activity Labels

Over the past few years, the success in action recognition on short trim...

0 Guodong Ding, et al. ∙

research

∙ 08/02/2021

Reliable Semantic Segmentation with Superpixel-Mix

Along with predictive performance and runtime speed, reliability is a ke...

2 Gianni Franchi, et al. ∙

research

∙ 07/26/2021

Efficient Video Object Segmentation with Compressed Video

We propose an efficient inference framework for semi-supervised video ob...

5 Kai Xu, et al. ∙

research

∙ 06/14/2021

Learning Deep Morphological Networks with Neural Architecture Search

Deep Neural Networks (DNNs) are generated by sequentially performing lin...

8 Yufei Hu, et al. ∙

research

∙ 06/06/2021

Transformed ROIs for Capturing Visual Transformations in Videos

Modeling the visual changes that an action brings to a scene is critical...

5 Abhinav Rai, et al. ∙

research

∙ 06/06/2021

Learning Video Models from Text: Zero-Shot Anticipation for Procedural Actions

Can we teach a robot to recognize and make predictions for activities th...

0 Fadime Sener, et al. ∙

research

∙ 06/06/2021

Technical Report: Temporal Aggregate Representations

This technical report extends our work presented in [9] with more experi...

21 Fadime Sener, et al. ∙

research

∙ 05/25/2021

Towards Compact Single Image Super-Resolution via Contrastive Self-distillation

Convolutional neural networks (CNNs) are highly successful for super-res...

3 Yanbo Wang, et al. ∙

research

∙ 05/23/2021

Coarse to Fine Multi-Resolution Temporal Convolutional Network

Temporal convolutional networks (TCNs) are a commonly used architecture ...

0 Dipika Singhania, et al. ∙

research

∙ 05/18/2021

NExT-QA:Next Phase of Question-Answering to Explaining Temporal Actions

We introduce NExT-QA, a rigorously designed video question answering (Vi...

13 Junbin Xiao, et al. ∙

research

∙ 10/19/2020

Multi-Stage Fusion for One-Click Segmentation

Segmenting objects of interest in an image is an essential building bloc...

4 Soumajit Majumder, et al. ∙

research

∙ 10/18/2020

Localized Interactive Instance Segmentation

In current interactive instance segmentation works, the user is granted ...

0 Soumajit Majumder, et al. ∙

research

∙ 07/22/2020

Rethinking CNN Models for Audio Classification

In this paper, we show that ImageNet-Pretrained standard deep CNN models...

0 Kamalesh Palanisamy, et al. ∙

research

∙ 06/01/2020

Temporal Aggregate Representations for Long Term Video Understanding

Future prediction requires reasoning from current and past observations ...

0 Fadime Sener, et al. ∙

research

∙ 04/20/2020

Towards deep neural network compression via learnable wavelet transforms

Wavelets are well known for data compression, yet have rarely been appli...

0 Moritz Wolter, et al. ∙

research

∙ 03/30/2020

Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

In this work, we study how well different type of approaches generalise ...

1 Anil Armagan, et al. ∙

research

∙ 12/13/2019

Bonn Activity Maps: Dataset Description

The key prerequisite for accessing the huge potential of current machine...

28 Julian Tanke, et al. ∙

research

∙ 07/24/2019

Dual Grid Net: hand mesh vertex regression from single depth maps

We present a method for recovering the dense 3D surface of the hand by r...

1 Chengde Wan, et al. ∙

research

∙ 12/13/2018

Fourier RNNs for Sequence Analysis and Prediction

Fourier methods have a long and proven track record in as an excellent t...

0 Moritz Wolter, et al. ∙

research

∙ 12/10/2018

Supervised Deep Kriging for Single-Image Super-Resolution

We propose a novel single-image super-resolution approach based on the g...

0 Gianni Franchi, et al. ∙

research

∙ 12/09/2018

Learning Style Compatibility for Furniture

When judging style, a key question that often arises is whether or not a...

0 Divyansh Aggarwal, et al. ∙

research

∙ 12/07/2018

Scale-aware multi-level guidance for interactive instance segmentation

In interactive instance segmentation, users give feedback to iteratively...

16 Soumajit Majumder, et al. ∙

research

∙ 12/06/2018

Zero-Shot Anticipation for Instructional Activities

How can we teach a robot to predict what will happen next for an activit...

6 Fadime Sener, et al. ∙

research

∙ 12/03/2018

Disentangling Latent Hands for Image Synthesis and Pose Estimation

Hand image synthesis and pose estimation from RGB images are both highly...

0 Linlin Yang, et al. ∙

research

∙ 10/25/2018

HANDS18: Methods, Techniques and Applications for Hand Observation

This report outlines the proceedings of the Fourth International Worksho...

0 Iason Oikonomidis, et al. ∙

Angela Yao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro