
-
Can Everybody Sign Now? Exploring Sign Language Video Generation from 2D Poses
Recent work have addressed the generation of human poses represented by ...
read it
-
RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation
The task of video object segmentation with referring expressions (langua...
read it
-
Mask-guided sample selection for Semi-Supervised Instance Segmentation
Image segmentation methods are usually trained with pixel-level annotati...
read it
-
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language
Sign Language is the primary means of communication for the majority of ...
read it
-
Curriculum Learning for Recurrent Video Object Segmentation
Video object segmentation can be understood as a sequence-to-sequence ta...
read it
-
Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and Videos
In this work, we propose an effective approach for training unique embed...
read it
-
Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills
Acquiring abilities in the absence of a task-oriented reward function is...
read it
-
Recurrent Instance Segmentation using Sequences of Referring Expressions
The goal of this work is to segment the objects in an image that are ref...
read it
-
Automatic Reminiscence Therapy for Dementia
With people living longer than ever, the number of cases with dementia s...
read it
-
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
This work addresses the challenge of hate speech detection in Internet m...
read it
-
Assessing Knee OA Severity with CNN attention-based end-to-end architectures
This work proposes a novel end-to-end convolutional neural network (CNN)...
read it
-
Hyperparameter-Free Losses for Model-Based Monocular Reconstruction
This work proposes novel hyperparameter-free losses for single view 3D r...
read it
-
Temporal recurrences for video saliency prediction
This paper investigates modifying an existing neural network architectur...
read it
-
Simple vs complex temporal recurrences for video saliency prediction
This paper investigates modifying an existing neural network architectur...
read it
-
Budget-aware Semi-Supervised Semantic and Instance Segmentation
Methods that move towards less supervised scenarios are key for image se...
read it
-
Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks
Speech is a rich biometric signal that contains information about the id...
read it
-
RVOS: End-to-End Recurrent Network for Video Object Segmentation
Multiple object video object segmentation is a challenging task, special...
read it
-
Inverse Cooking: Recipe Generation from Food Images
People enjoy food photography because they appreciate food. Behind each ...
read it
-
Importance Weighted Evolution Strategies
Evolution Strategies (ES) emerged as a scalable alternative to popular R...
read it
-
PathGAN: Visual Scanpath Prediction with Generative Adversarial Networks
We introduce PathGAN, a deep neural network for visual scanpath predicti...
read it
-
Temporal Saliency Adaptation in Egocentric Videos
This work adapts a deep neural model for image saliency prediction to th...
read it
-
Comparing Fixed and Adaptive Computation Time for Recurrent Neural Networks
Adaptive Computation Time for Recurrent Neural Networks (ACT) is one of ...
read it
-
Online Action Detection in Untrimmed, Streaming Videos - Modeling and Evaluation
The goal of Online Action Detection (OAD) is to detect action in a timel...
read it
-
Cross-modal Embeddings for Video and Audio Retrieval
The increasing amount of online videos brings several opportunities for ...
read it
-
Recurrent Neural Networks for Semantic Instance Segmentation
We present a recurrent model for semantic instance segmentation that seq...
read it
-
Detection-aided liver lesion segmentation using deep learning
A fully automatic technique for segmenting the liver and localizing its ...
read it
-
Saliency Weighted Convolutional Features for Instance Search
This work explores attention models to weight the contribution of local ...
read it
-
Cost-Effective Active Learning for Melanoma Segmentation
We propose a novel Active Learning framework capable to train effectivel...
read it
-
Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks
Recurrent Neural Networks (RNNs) continue to show outstanding performanc...
read it
-
More cat than cute? Interpretable Prediction of Adjective-Noun Pairs
The increasing availability of affect-rich multimedia resources has bols...
read it
-
Disentangling Motion, Foreground and Background Features in Videos
This paper introduces an unsupervised framework to extract semantically ...
read it
-
SaltiNet: Scan-path Prediction on 360 Degree Images using Saliency Volumes
We introduce SaltiNet, a deep neural network for scanpath prediction tra...
read it
-
Class-Weighted Convolutional Features for Visual Instance Search
Image retrieval in realistic scenarios targets large dynamic datasets of...
read it
-
SalGAN: Visual Saliency Prediction with Generative Adversarial Networks
We introduce SalGAN, a deep convolutional neural network for visual sali...
read it
-
Hierarchical Object Detection with Deep Reinforcement Learning
We present a method for performing hierarchical object detection in imag...
read it
-
Open-Ended Visual Question-Answering
This thesis report studies methods to solve Visual Question-Answering (V...
read it
-
Where is my Phone ? Personal Object Retrieval from Egocentric Images
This work presents a retrieval pipeline and evaluation scheme for the pr...
read it
-
Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks
This thesis explore different approaches using Convolutional and Recurre...
read it
-
Bags of Local Convolutional Features for Scalable Instance Search
This work proposes a simple instance retrieval pipeline based on encodin...
read it
-
From Pixels to Sentiment: Fine-tuning CNNs for Visual Sentiment Prediction
Visual multimedia have become an inseparable part of our digital social ...
read it
-
Shallow and Deep Convolutional Networks for Saliency Prediction
The prediction of salient areas in images has been traditionally address...
read it
-
End-to-end Convolutional Network for Saliency Prediction
The prediction of saliency areas in images has been traditionally addres...
read it
-
Improving Spatial Codification in Semantic Segmentation
This paper explores novel approaches for improving the spatial codificat...
read it
-
Visual Summary of Egocentric Photostreams by Representative Keyframes
Building a visual summary from an egocentric photostream captured by a l...
read it
-
Quality Control in Crowdsourced Object Segmentation
This paper explores processing techniques to deal with noisy data in cro...
read it
-
Visual Information Retrieval in Endoscopic Video Archives
In endoscopic procedures, surgeons work with live video streams from the...
read it
-
Cultural Event Recognition with Visual ConvNets and Temporal Models
This paper presents our contribution to the ChaLearn Challenge 2015 on C...
read it
-
Exploring EEG for Object Detection and Retrieval
This paper explores the potential for using Brain Computer Interfaces (B...
read it
-
Object Segmentation in Images using EEG Signals
This paper explores the potential of brain-computer interfaces in segmen...
read it