
-
Structural and Functional Decomposition for Personality Image Captioning in a Communication Game
Personality image captioning (PIC) aims to describe an image with a natu...
read it
-
Detecting Hands and Recognizing Physical Contact in the Wild
We investigate a new problem of detecting hands and recognizing their ph...
read it
-
Uncertainty Estimation and Sample Selection for Crowd Counting
We present a method for image-based crowd counting, one that can predict...
read it
-
Distribution Matching for Crowd Counting
In crowd counting, each training image contains multiple people, where e...
read it
-
A Study of Human Gaze Behavior During Visual Crowd Counting
In this paper, we describe our study on how humans allocate their attent...
read it
-
Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning
Being able to predict human gaze behavior has obvious importance for beh...
read it
-
Predicting Goal-directed Attention Control Using Inverse-Reinforcement Learning
Understanding how goal states control behavior is a question ripe for in...
read it
-
Visual Understanding of Multiple Attributes Learning Model of X-Ray Scattering Images
This extended abstract presents a visualization system, which is designe...
read it
-
Attentive Action and Context Factorization
We propose a method for human action recognition, one that can localize ...
read it
-
Contextual Attention for Hand Detection in the Wild
We present Hand-CNN, a novel convolutional network architecture for dete...
read it
-
Back to the Future: Knowledge Distillation for Human Action Anticipation
We consider the task of training a neural network to anticipate human ac...
read it
-
BusyHands: A Hand-Tool Interaction Database for Assembly Tasks Semantic Segmentation
Visual segmentation has seen tremendous advancement recently with ready ...
read it
-
GIF2Video: Color Dequantization and Temporal Interpolation of GIF images
Graphics Interchange Format (GIF) is a highly portable graphics format t...
read it
-
Fake Sentence Detection as a Training Task for Sentence Encoding
Sentence encoders are typically trained on language modeling tasks which...
read it
-
Iterative Crowd Counting
In this work, we tackle the problem of crowd counting in images. We pres...
read it
-
A+D-Net: Shadow Detection with Adversarial Shadow Attenuation
Single image shadow detection is a very challenging problem because of t...
read it
-
X-ray Scattering Image Classification Using Deep Learning
Visual inspection of x-ray scattering images is a powerful technique for...
read it
-
Latent Bi-constraint SVM for Video-based Object Recognition
We address the task of recognizing objects from video input. This import...
read it
-
Improving Human Action Recognition by Non-action Classification
In this paper we consider the task of recognizing human actions in reali...
read it