
-
Image Sentiment Transfer
In this work, we introduce an important but still unexplored research ta...
read it
-
TailorGAN: Making User-Defined Fashion Designs
Attribute editing has become an important and emerging topic of computer...
read it
-
UR-FUNNY: A Multimodal Language Dataset for Understanding Humor
Humor is a unique and creative communicative behavior displayed during s...
read it
-
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
Existing image-text matching approaches typically infer the similarity o...
read it
-
Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
In this paper, we explore the space-time video super-resolution task, wh...
read it
-
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection
We address weakly-supervised video actor-action segmentation (VAAS), whi...
read it
-
TransMatch: A Transfer-Learning Scheme for Semi-Supervised Few-Shot Learning
The successful application of deep learning to many visual recognition t...
read it
-
Semantic Neural Machine Translation using AMR
It is intuitive that semantic representations can be useful for machine ...
read it
-
Anatomy-aware 3D Human Pose Estimation in Videos
In this work, we propose a new solution for 3D human pose estimation in ...
read it
-
Exploiting Temporal Relationships in Video Moment Localization with Natural Language
We address the problem of video moment localization with natural languag...
read it
-
Stimulating Creativity with FunLines: A Case Study of Humor Generation in Headlines
Building datasets of creative text, such as humor, is quite challenging....
read it
-
CariGAN: Caricature Generation through Weakly Paired Adversarial Learning
Caricature generation is an interesting yet challenging task. The primar...
read it
-
Deep Audio Prior
Deep convolutional neural networks are known to specialize in distilling...
read it
-
Grounding-Tracking-Integration
In this paper, we study tracking by language that localizes the target b...
read it
-
Weakly Supervised Object Localization with Inter-Intra Regulated CAMs
Weakly supervised object localization (WSOL) aims to locate objects in i...
read it
-
Real-time Universal Style Transfer on High-resolution Images via Zero-channel Pruning
Extracting effective deep features to represent content and style inform...
read it
-
Hierarchical Cross-Modal Talking Face Generationwith Dynamic Pixel-Wise Loss
We devise a cascade GAN approach to generate talking face video, which i...
read it
-
TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images
An unsupervised image-to-image translation (UI2I) task deals with learni...
read it
-
Fast Universal Style Transfer for Artistic and Photorealistic Rendering
Universal style transfer is an image editing task that renders an input ...
read it
-
M-BERT: Injecting Multimodal Information in the BERT Structure
Multimodal language analysis is an emerging research area in natural lan...
read it
-
Example-Guided Scene Image Synthesis using Masked Spatial-Channel Attention and Patch-Based Self-Supervision
Example-guided image synthesis has been recently attempted to synthesize...
read it
-
Learning from Interventions using Hierarchical Policies for Safe Learning
Learning from Demonstrations (LfD) via Behavior Cloning (BC) works well ...
read it
-
What comprises a good talking-head video generation?: A Survey and Benchmark
Over the years, performance evaluation has become essential in computer ...
read it
-
On Vocabulary Reliance in Scene Text Recognition
The pursuit of high performance on public benchmarks has been the drivin...
read it
-
SemEval-2020 Task 7: Assessing Humor in Edited News Headlines
This paper describes the SemEval-2020 shared task "Assessing Humor in Ed...
read it
-
Video Re-localization
Many methods have been developed to help people find the video contents ...
read it
-
History-Aware Question Answering in a Blocks World Dialogue System
It is essential for dialogue-based spatial reasoning systems to maintain...
read it
-
Global Image Sentiment Transfer
Transferring the sentiment of an image is an unexplored research topic i...
read it
-
Efficient non-conjugate Gaussian process factor models for spike count data using polynomial approximations
Gaussian Process Factor Analysis (GPFA) has been broadly applied to the ...
read it
-
Generative Mask Pyramid Network forCT/CBCT Metal Artifact Reduction with Joint Projection-Sinogram Correction
A conventional approach to computed tomography (CT) or cone beam CT (CBC...
read it
-
Large-scale Tag-based Font Retrieval with Generative Feature Learning
Font selection is one of the most important steps in a design workflow. ...
read it
-
Assembling Semantically-Disentangled Representations for Predictive-Generative Models via Adaptation from Synthetic Domain
Deep neural networks can form high-level hierarchical representations of...
read it
-
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing
In this paper, we introduce a new problem, named audio-visual video pars...
read it
-
Revealing patterns in HIV viral load data and classifying patients via a novel machine learning cluster summarization method
HIV RNA viral load (VL) is an important outcome variable in studies of H...
read it
-
Predicting Acute Kidney Injury at Hospital Re-entry Using High-dimensional Electronic Health Record Data
Acute Kidney Injury (AKI), a sudden decline in kidney function, is assoc...
read it
-
Navigation by Imitation in a Pedestrian-Rich Environment
Deep neural networks trained on demonstrations of human actions give rob...
read it
-
ADN: Artifact Disentanglement Network for Unsupervised Metal Artifact Reduction
Current deep neural network based approaches to computed tomography (CT)...
read it
-
Adaptive Offline Quintuplet Loss for Image-Text Matching
Existing image-text matching approaches typically leverage triplet loss ...
read it
-
Learning to Generate Time-Lapse Videos Using Multi-Stage Dynamic Generative Adversarial Networks
Taking a photo outside, can we predict the immediate future, e.g., how w...
read it
-
Vehicle Tracking in Wide Area Motion Imagery via Stochastic Progressive Association Across Multiple Frames (SPAAM)
Vehicle tracking in Wide Area Motion Imagery (WAMI) relies on associatin...
read it
-
An Interactive Greedy Approach to Group Sparsity in High Dimension
Sparsity learning with known grouping structures has received considerab...
read it
-
What the Language You Tweet Says About Your Occupation
Many aspects of people's lives are proven to be deeply connected to thei...
read it
-
On The Projection Operator to A Three-view Cardinality Constrained Set
The cardinality constraint is an intrinsic way to restrict the solution ...
read it
-
Cultural Diffusion and Trends in Facebook Photographs
Online social media is a social vehicle in which people share various mo...
read it
-
TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation
Action segmentation as a milestone towards building automatic systems to...
read it
-
A Computational Framework for Nonlinear Dimensionality Reduction of Large Data Sets: The Exploratory Inspection Machine (XIM)
In this paper, we present a novel computational framework for nonlinear ...
read it
-
The ZipML Framework for Training Models with End-to-End Low Precision: The Cans, the Cannots, and a Little Bit of Deep Learning
Recently there has been significant interest in training machine-learnin...
read it
-
Inferring Fine-grained Details on User Activities and Home Location from Social Media: Detecting Drinking-While-Tweeting Patterns in Communities
Nearly all previous work on geo-locating latent states and activities fr...
read it
-
Towards Automatic Learning of Procedures from Web Instructional Videos
The potential for agents, whether embodied or software, to learn by obse...
read it
-
Inferring Restaurant Styles by Mining Crowd Sourced Photos from User-Review Websites
When looking for a restaurant online, user uploaded photos often give pe...
read it