
-
Lexically-constrained Text Generation through Commonsense Knowledge Extraction and Injection
Conditional text generation has been a challenging task that is yet to s...
read it
-
Exploring the Hierarchy in Relation Labels for Scene Graph Generation
By assigning each relationship a single label, current approaches formul...
read it
-
Recognizing Video Events with Varying Rhythms
Recognizing Video events in long, complex videos with multiple sub-activ...
read it
-
PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph
Despite some exciting progress on high-quality image generation from str...
read it
-
Disentangling Pose from Appearance in Monochrome Hand Images
Hand pose estimation from the monocular 2D image is challenging due to t...
read it
-
Perceive Where to Focus: Learning Visibility-aware Part-level Features for Partial Person Re-identification
This paper considers a realistic problem in person re-identification (re...
read it
-
Plan-Recognition-Driven Attention Modeling for Visual Recognition
Human visual recognition of activities or external agents involves an in...
read it
-
Mean Local Group Average Precision (mLGAP): A New Performance Metric for Hashing-based Retrieval
The research on hashing techniques for visual data is gaining increased ...
read it
-
Question-Guided Hybrid Convolution for Visual Question Answering
In this paper, we propose a novel Question-Guided Hybrid Convolution (QG...
read it
-
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation
Generating scene graph to describe all the relations inside an image gai...
read it
-
Training Neural Networks by Using Power Linear Units (PoLUs)
In this paper, we introduce "Power Linear Unit" (PoLU) which increases t...
read it
-
Recognizing Plans by Learning Embeddings from Observed Action Distributions
Recent advances in visual activity recognition have raised the possibili...
read it
-
Semantically Consistent Image Completion with Fine-grained Details
Image completion has achieved significant progress due to advances in ge...
read it
-
Visual Question Generation as Dual Task of Visual Question Answering
Recently visual question answering (VQA) and visual question generation ...
read it
-
Scene Graph Generation from Objects, Phrases and Region Captions
Object detection, scene graph generation and region captioning, which ar...
read it
-
ViP-CNN: Visual Phrase Guided Convolutional Neural Network
As the intermediate level task connecting image captioning and object de...
read it