
-
A Dynamical View on Optimization Algorithms of Overparameterized Neural Networks
When equipped with efficient optimization algorithms, the over-parameter...
read it
-
FBNetV3: Joint Architecture-Recipe Search using Neural Acquisition Function
Neural Architecture Search (NAS) yields state-of-the-art neural networks...
read it
-
CPARR: Category-based Proposal Analysis for Referring Relationships
The task of referring relationships is to localize subject and object en...
read it
-
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions
Differentiable Neural Architecture Search (DNAS) has demonstrated great ...
read it
-
Video Object Grounding using Semantic Roles in Language Description
We explore the task of Video Object Grounding (VOG), which grounds objec...
read it
-
Zero-Shot Grounding of Objects from Natural Language Queries
A phrase grounding system localizes a particular object in an image refe...
read it
-
Cascaded Parallel Filtering for Memory-Efficient Image-Based Localization
Image-based localization (IBL) aims to estimate the 6DOF camera pose for...
read it
-
Billion-scale semi-supervised learning for image classification
This paper presents a study of semi-supervised learning with large convo...
read it
-
MAC: Mining Activity Concepts for Language-based Temporal Localization
We address the problem of language-based temporal localization in untrim...
read it
-
CTAP: Complementary Temporal Action Proposal Generation
Temporal action proposal generation is an important task, akin to object...
read it
-
Motion-Appearance Co-Memory Networks for Video Question Answering
Video Question Answering (QA) is an important task in understanding vide...
read it
-
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Given a natural language query, a phrase grounding system aims to locali...
read it
-
Query-guided Regression Network with Context Policy for Phrase Grounding
Given a textual description of an image, phrase grounding localizes obje...
read it
-
AMC: Attention guided Multi-modal Correlation Learning for Image Search
Given a user's query, traditional image search systems rank images accor...
read it
-
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals
Temporal Action Proposal (TAP) generation is an important problem, as fa...
read it
-
Knowledge Graph Representation with Jointly Structural and Textual Encoding
The objective of knowledge graph embedding is to encode both entities an...
read it
-
Learning Word Embeddings from Intrinsic and Extrinsic Views
While word embeddings are currently predominant for natural language pro...
read it