
-
Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context
Weakly-supervised Temporal Action Localization (WS-TAL) methods learn to...
read it
-
ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization
The object of Weakly-supervised Temporal Action Localization (WS-TAL) is...
read it
-
Model-based 3D Hand Reconstruction via Self-Supervised Learning
Reconstructing a 3D hand from a single-view RGB image is challenging due...
read it
-
Track to Detect and Segment: An Online Multi-Object Tracker
Most online multi-object trackers perform object detection stand-alone i...
read it
-
Rethinking Soft Labels for Knowledge Distillation: A Bias-Variance Tradeoff Perspective
Knowledge distillation is an effective approach to leverage a well-train...
read it
-
SPAGAN: Shortest Path Graph Attention Network
Graph convolutional networks (GCN) have recently demonstrated their pote...
read it
-
Interventional Domain Adaptation
Domain adaptation (DA) aims to transfer discriminative features learned ...
read it
-
Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
Weakly-supervised Temporal Action Localization (W-TAL) aims to classify ...
read it
-
Attention-Aware Noisy Label Learning for Image Classification
Deep convolutional neural networks (CNNs) learned on large-scale labeled...
read it
-
ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection
We consider the problem of Human-Object Interaction (HOI) Detection, whi...
read it
-
Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation
Despite the previous success of object analysis, detecting and segmentin...
read it
-
Revisiting Modified Greedy Algorithm for Monotone Submodular Maximization with a Knapsack Constraint
Monotone submodular maximization with a knapsack constraint is NP-hard. ...
read it
-
Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene
Learning on 3D scene-based point cloud has received extensive attention ...
read it
-
Deep Reinforcement Learning with Label Embedding Reward for Supervised Image Hashing
Deep hashing has shown promising results in image retrieval and recognit...
read it
-
Temporal Distinct Representation Learning for Action Recognition
Motivated by the previous success of Two-Dimensional Convolutional Neura...
read it
-
Structure-Aware Human-Action Generation
Generating long-range skeleton-based human actions has been a challengin...
read it
-
Joint Hand-object 3D Reconstruction from a Single Image with Cross-branch Feature Fusion
Accurate 3D reconstruction of the hand and object shape from a hand-obje...
read it
-
Towards Understanding the Adversarial Vulnerability of Skeleton-based Action Recognition
Skeleton-based action recognition has attracted increasing attention due...
read it
-
3DV: 3D Dynamic Voxel for Action Recognition in Depth Video
To facilitate depth-based 3D action recognition, 3D dynamic voxel (3DV) ...
read it
-
Image Co-skeletonization via Co-segmentation
Recent advances in the joint processing of images have certainly shown i...
read it
-
Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction
In this work, we study how well different type of approaches generalise ...
read it
-
Temporal Pulses Driven Spiking Neural Network for Fast Object Recognition in Autonomous Driving
Accurate real-time object recognition from sensory data has long been a ...
read it
-
Learning Diverse Stochastic Human-Action Generators by Learning Smooth Latent Transitions
Human-motion generation is a long-standing challenging task due to the r...
read it
-
A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image
For 3D hand and body pose estimation task in depth image, a novel anchor...
read it
-
Context-Integrated and Feature-Refined Network for Lightweight Urban Scene Parsing
Semantic segmentation for lightweight urban scene parsing is a very chal...
read it
-
Bayesian Uncertainty Matching for Unsupervised Domain Adaptation
Domain adaptation is an important technique to alleviate performance deg...
read it
-
Kervolutional Neural Networks
Convolutional neural networks (CNNs) have enabled the state-of-the-art p...
read it
-
3D Hand Shape and Pose Estimation from a Single RGB Image
This work addresses a novel and challenging problem of estimating the fu...
read it
-
Progress Regression RNN for Online Spatial-Temporal Action Localization in Unconstrained Videos
Previous spatial-temporal action localization methods commonly follow th...
read it
-
Learning Saliency Maps for Adversarial Point-Cloud Generation
3D point-cloud recognition with deep neural network (DNN) has received r...
read it
-
Exploiting Local Feature Patterns for Unsupervised Domain Adaptation
Unsupervised domain adaptation methods aim to alleviate performance degr...
read it
-
Actor-Action Semantic Segmentation with Region Masks
In this paper, we study the actor-action semantic segmentation problem, ...
read it
-
Towards Profit Maximization for Online Social Network Providers
Online Social Networks (OSNs) attract billions of users to share informa...
read it
-
3D Hand Pose Estimation: From Current Achievements to Future Goals
In this paper, we strive to answer two questions: What is the current st...
read it
-
Non-Iterative Localization and Fast Mapping
This paper presents a non-iterative method for dense mapping using inert...
read it
-
Kernel Cross-Correlator
Cross-correlator plays a significant role in many visual perception task...
read it
-
Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs
Articulated hand pose estimation plays an important role in human-comput...
read it