
-
Animating Arbitrary Objects via Deep Motion Transfer
This paper introduces a novel deep learning framework for image animatio...
read it
-
Laplace Landmark Localization
Landmark localization in images and videos is a classic problem solved i...
read it
-
3D Hand Shape and Pose Estimation from a Single RGB Image
This work addresses a novel and challenging problem of estimating the fu...
read it
-
A Caption Is Worth A Thousand Images: Investigating Image Captions for Multimodal Named Entity Recognition
Multimodal named entity recognition (MNER) requires to bridge the gap be...
read it
-
Y-Autoencoders: disentangling latent representations via sequential-encoding
In the last few years there have been important advancements in generati...
read it
-
Robust Emotion Recognition from Low Quality and Low Bit Rate Video: A Deep Learning Approach
Emotion recognition from facial expressions is tremendously useful, espe...
read it
-
Efficient Video Object Segmentation via Network Modulation
Video object segmentation targets at segmenting a specific object throug...
read it
-
Deep Regionlets for Object Detection
A key challenge in generic object detection is being to handle large var...
read it
-
Hybrid VAE: Improving Deep Generative Models using Partial Observations
Deep neural network models trained on large labeled datasets are the sta...
read it
-
Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Image captioning is a challenging problem owing to the complexity in und...
read it
-
SEP-Nets: Small and Effective Pattern Networks
While going deeper has been witnessed to improve the performance of conv...
read it
-
Unsupervised Domain Adaptation for 3D Keypoint Prediction from a Single Depth Scan
In this paper, we introduce a novel unsupervised domain adaptation techn...
read it
-
Dense Captioning with Joint Inference and Visual Context
Dense captioning is a newly emerging computer vision topic for understan...
read it
-
Learning 3D-FilterMap for Deep Convolutional Neural Networks
We present a novel and compact architecture for deep Convolutional Neura...
read it
-
mvn2vec: Preservation and Collaboration in Multi-View Network Embedding
Multi-view networks are ubiquitous in real-world applications. In order ...
read it
-
Multimodal Named Entity Recognition for Short Social Media Posts
We introduce a new task called Multimodal Named Entity Recognition (MNER...
read it
-
StarMap for Category-Agnostic Keypoint and Viewpoint Estimation
Semantic keypoints provide concise abstractions for a variety of visual ...
read it
-
Semi-supervised Content-based Detection of Misinformation via Tensor Embeddings
Fake news may be intentionally created to promote economic, political an...
read it
-
Learn to Combine Modalities in Multimodal Deep Learning
Combining complementary information from multiple modalities is intuitiv...
read it
-
Differentially-Private "Draw and Discard" Machine Learning
In this work, we propose a novel framework for privacy-preserving client...
read it
-
False Discovery Rate Controlled Heterogeneous Treatment Effect Detection for Online Controlled Experiments
Online controlled experiments (a.k.a. A/B testing) have been used as the...
read it
-
Wide Activation for Efficient and Accurate Image Super-Resolution
In this report we demonstrate that with same parameters and computationa...
read it
-
YouTube-VOS: Sequence-to-Sequence Video Object Segmentation
Learning long-term spatial-temporal features are critical for many video...
read it
-
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark
Learning long-term spatial-temporal features are critical for many video...
read it
-
Deep neural network based i-vector mapping for speaker verification using short utterances
Text-independent speaker recognition using short utterances is a highly ...
read it
-
Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection
In this paper, we propose a novel object detection algorithm named "Deep...
read it
-
Analyzing the Use of Camera Glasses in the Wild
Camera glasses enable people to capture point-of-view videos using a com...
read it
-
Singing voice conversion with non-parallel data
Singing voice conversion is a task to convert a song sang by a source si...
read it
-
Impact of Contextual Factors on Snapchat Public Sharing
Public sharing is integral to online platforms. This includes the popula...
read it
-
Train One Get One Free: Partially Supervised Neural Network for Bug Report Duplicate Detection and Clustering
Tracking user reported bugs requires considerable engineering effort in ...
read it
-
Animo: Sharing Biosignals on a Smartwatch for Lightweight Social Connection
We present Animo, a smartwatch app that enables people to share and view...
read it
-
EVA: Generating Emotional Behavior of Virtual Agents using Expressive Features of Gait and Gaze
We present a novel, real-time algorithm, EVA, for generating virtual age...
read it
-
Blocks: Collaborative and Persistent Augmented Reality Experiences
We introduce Blocks, a mobile application that enables people to co-crea...
read it
-
Anchor Tasks: Inexpensive, Shared, and Aligned Tasks for Domain Adaptation
We introduce a novel domain adaptation formulation from synthetic datase...
read it
-
SilceNDice: Mining Suspicious Multi-attribute Entity Groups with Multi-view Graphs
Given the reach of web platforms, bad actors have considerable incentive...
read it
-
SliceNDice: Mining Suspicious Multi-attribute Entity Groups with Multi-view Graphs
Given the reach of web platforms, bad actors have considerable incentive...
read it
-
Neural Rule Grounding for Low-Resource Relation Extraction
While deep neural models have gained successes on information extraction...
read it
-
I Know You'll Be Back: Interpretable New User Clustering and Churn Prediction on a Mobile Social Application
As online platforms are striving to get more users, a critical challenge...
read it
-
First Order Motion Model for Image Animation
Image animation consists of generating a video sequence so that an objec...
read it
-
Sifter: A Hybrid Workflow for Theme-based Video Curation at Scale
User-generated content platforms curate their vast repositories into the...
read it
-
HiJoD: Semi-Supervised Multi-aspect Detection of Misinformation using Hierarchical Joint Decomposition
Distinguishing between misinformation and real information is one of the...
read it
-
Knowing your FATE: Friendship, Action and Temporal Explanations for User Engagement Prediction on Social Apps
With the rapid growth and prevalence of social network applications (App...
read it
-
Revisiting visual-inertial structure from motion for odometry and SLAM initialization
In this paper, an efficient closed-form solution for the state initializ...
read it
-
Social App Accessibility for Deaf Signers
Social media platforms support the sharing of written text, video, and a...
read it
-
Renormalization for Initialization of Rolling Shutter Visual-Inertial Odometry
In this paper we deal with the initialization problem of a visual-inerti...
read it
-
Ego-Motion Alignment from Face Detections for Collaborative Augmented Reality
Sharing virtual content among multiple smart glasses wearers is an essen...
read it
-
Identifying Misinformation from Website Screenshots
Can the look and the feel of a website give information about the trustw...
read it