
-
Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges
In the past decade, object detection has achieved significant progress i...
read it
-
Are Top School Students More Critical of Their Professors? Mining Comments on RateMyProfessor.com
Student reviews and comments on RateMyProfessor.com reflect realistic le...
read it
-
DAIL: Dataset-Aware and Invariant Learning for Face Recognition
To achieve good performance in face recognition, a large scale training ...
read it
-
Semantic Layout Manipulation with High-Resolution Sparse Attention
We tackle the problem of semantic image layout manipulation, which aims ...
read it
-
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
In this paper, we propose Text-Aware Pre-training (TAP) for Text-VQA and...
read it
-
Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language
We address the problem of retrieving a specific moment from an untrimmed...
read it
-
XraySyn: Realistic View Synthesis From a Single Radiograph Through CT Priors
A radiograph visualizes the internal anatomy of a patient through the us...
read it
-
Slender Object Detection: Diagnoses and Improvements
In this paper, we are concerned with the detection of a particular type ...
read it
-
Content-based Analysis of the Cultural Differences between TikTok and Douyin
Short-form video social media shifts away from the traditional media par...
read it
-
Face Off: Polarized Public Opinions on Personal Face Mask Usage during the COVID-19 Pandemic
In spite of a growing body of scientific evidence on the effectiveness o...
read it
-
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation
Inspired by the human ability to infer emotions from body language, we p...
read it
-
Understanding the Hoarding Behaviors during the COVID-19 Pandemic using Large Scale Social Media Data
The COVID-19 pandemic has affected people's lives around the world at a ...
read it
-
Predicting Parkinson's Disease with Multimodal Irregularly Collected Longitudinal Smartphone Data
Parkinsons Disease is a neurological disorder and prevalent in elderly p...
read it
-
Region Comparison Network for Interpretable Few-shot Image Classification
While deep learning has been successfully applied to many real-world com...
read it
-
Dynamic Context-guided Capsule Network for Multimodal Machine Translation
Multimodal machine translation (MMT), which mainly focuses on enhancing ...
read it
-
Learning to Localize Actions from Moments
With the knowledge of action moments (i.e., trimmed video clips that eac...
read it
-
A Smartphone-based System for Real-time Early Childhood Caries Diagnosis
Early childhood caries (ECC) is the most common, yet preventable chronic...
read it
-
Improving One-stage Visual Grounding by Recursive Sub-query Construction
We improve one-stage visual grounding by addressing current limitations ...
read it
-
Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification
Visible-infrared person re-identification (VI-ReID) is a challenging cro...
read it
-
A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation
Multi-modal neural machine translation (NMT) aims to translate source se...
read it
-
Universal Model for Multi-Domain Medical Image Retrieval
Medical Image Retrieval (MIR) helps doctors quickly find similar patient...
read it
-
Task-agnostic Temporally Consistent Facial Video Editing
Recent research has witnessed the advances in facial image editing tasks...
read it
-
Monitoring Depression Trend on Twitter during the COVID-19 Pandemic
The COVID-19 pandemic has severely affected people's daily lives and cau...
read it
-
Global Image Sentiment Transfer
Transferring the sentiment of an image is an unexplored research topic i...
read it
-
Image Sentiment Transfer
In this work, we introduce an important but still unexplored research ta...
read it
-
Real-time Universal Style Transfer on High-resolution Images via Zero-channel Pruning
Extracting effective deep features to represent content and style inform...
read it
-
Personalized Fashion Recommendation from Personal Social Media Data: An Item-to-Set Metric Learning Approach
With the growth of online shopping for fashion products, accurate fashio...
read it
-
On Vocabulary Reliance in Scene Text Recognition
The pursuit of high performance on public benchmarks has been the drivin...
read it
-
Unsupervised Real-world Low-light Image Enhancement with Decoupled Networks
Conventional learning-based approaches to low-light image enhancement ty...
read it
-
In the Eyes of the Beholder: Sentiment and Topic Analyses on Social Media Use of Neutral and Controversial Terms for COVID-19
During the COVID-19 pandemic, "Chinese Virus" emerged as a controversial...
read it
-
The Ivory Tower Lost: How College Students Respond Differently than the General Public to the COVID-19 Pandemic
Recently, the pandemic of the novel Coronavirus Disease-2019 (COVID-19) ...
read it
-
Alleviating the Incompatibility between Cross Entropy Loss and Episode Training for Few-shot Skin Disease Classification
Skin disease classification from images is crucial to dermatological dia...
read it
-
Example-Guided Image Synthesis across Arbitrary Scenes using Masked Spatial-Channel Attention and Self-Supervision
Example-guided image synthesis has recently been attempted to synthesize...
read it
-
Structured Landmark Detection via Topology-Adapting Deep Graph Learning
Image landmark detection aims to automatically identify the locations of...
read it
-
Unsupervised Learning of Landmarks based on Inter-Intra Subject Consistencies
We present a novel unsupervised learning approach to image landmark disc...
read it
-
TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images
An unsupervised image-to-image translation (UI2I) task deals with learni...
read it
-
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection
We address weakly-supervised video actor-action segmentation (VAAS), whi...
read it
-
Video-based Person Re-Identification using Gated Convolutional Recurrent Neural Networks
Deep neural networks have been successfully applied to solving the video...
read it
-
Unifying Specialist Image Embedding into Universal Image Embedding
Deep image embedding provides a way to measure the semantic similarity o...
read it
-
Adaptive Offline Quintuplet Loss for Image-Text Matching
Existing image-text matching approaches typically leverage triplet loss ...
read it
-
Anatomy-aware 3D Human Pose Estimation in Videos
In this work, we propose a new solution for 3D human pose estimation in ...
read it
-
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
Existing image-text matching approaches typically infer the similarity o...
read it
-
Mi YouTube es Su YouTube? Analyzing the Cultures using YouTube Thumbnails of Popular Videos
YouTube, a world-famous video sharing website, maintains a list of the t...
read it
-
#MeToo on Campus: Studying College Sexual Assault at Scale Using Data Reported on Social Media
Recently, the emergence of the #MeToo trend on social media has empowere...
read it
-
Fine-grained Image-to-Image Transformation towards Visual Recognition
Existing image-to-image transformation approaches primarily focus on syn...
read it
-
Measuring Women Representation and Impact in Films over Time
Women have always been underrepresented in movies and not until recently...
read it
-
Neural Simile Recognition with Cyclic Multitask Learning and Local Attention
Simile recognition is to detect simile sentences and to extract simile c...
read it
-
TransMatch: A Transfer-Learning Scheme for Semi-Supervised Few-Shot Learning
The successful application of deep learning to many visual recognition t...
read it
-
Iterative Dual Domain Adaptation for Neural Machine Translation
Previous studies on the domain adaptation for neural machine translation...
read it
-
Graph-based Neural Sentence Ordering
Sentence ordering is to restore the original paragraph from a set of sen...
read it