
-
The NPU System for the 2020 Personalized Voice Trigger Challenge
This paper describes the system developed by the NPU team for the 2020 p...
read it
-
EEGFuseNet: Hybrid Unsupervised Deep Feature Characterization and Fusion for High-Dimensional EEG with An Application to Emotion Recognition
How to effectively and efficiently extract valid and reliable features f...
read it
-
Failure Prediction in Production Line Based on Federated Learning: An Empirical Study
Data protection across organizations is limiting the application of cent...
read it
-
Few-shot Action Recognition with Prototype-centered Attentive Learning
Few-shot action recognition aims to recognize action classes with few tr...
read it
-
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Most recent semantic segmentation methods adopt a fully-convolutional ne...
read it
-
Sub-sampled Cross-component Prediction for Emerging Video Coding Standards
Cross-component linear model (CCLM) prediction has been repeatedly prove...
read it
-
Unifying Homophily and Heterophily Network Transformation via Motifs
Higher-order proximity (HOP) is fundamental for most network embedding m...
read it
-
Hop-Hop Relation-aware Graph Neural Networks
Graph Neural Networks (GNNs) are widely used in graph representation lea...
read it
-
A Novel 3D Non-Stationary Multi-Frequency Multi-Link Wideband MIMO Channel Model
In this paper, a multi-frequency multi-link three-dimensional (3D) non-s...
read it
-
A Systematic Literature Review on Federated Learning: From A Model Quality Perspective
As an emerging technique, Federated Learning (FL) can jointly train a gl...
read it
-
Boundary-sensitive Pre-training for Temporal Localization in Videos
Many video analysis tasks require temporal localization thus detection o...
read it
-
Direct Classification of Emotional Intensity
In this paper, we present a model that can directly predict emotion inte...
read it
-
Towards Efficient Scene Understanding via Squeeze Reasoning
Graph-based convolutional model such as non-local block has shown to be ...
read it
-
Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition
Humans can easily recognize actions with only a few examples given, whil...
read it
-
LID 2020: The Learning from Imperfect Data Challenge Results
Learning from imperfect data becomes an issue in many industrial applica...
read it
-
Towards Optimal Filter Pruning with Balanced Performance and Pruning Speed
Filter pruning has drawn more attention since resource constrained platf...
read it
-
Holistic Grid Fusion Based Stop Line Estimation
Intersection scenarios provide the most complex traffic situations in Au...
read it
-
Small but Mighty: New Benchmarks for Split and Rephrase
Split and Rephrase is a text simplification task of rewriting a complex ...
read it
-
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
We propose a suite of reasoning tasks on two types of relations between ...
read it
-
Intent Detection with WikiHow
Modern task-oriented dialog systems need to reliably understand users' i...
read it
-
Dual-constrained Deep Semi-Supervised Coupled Factorization Network with Enriched Prior
Nonnegative matrix factorization is usually powerful for learning the "s...
read it
-
Low Complexity Trellis-Coded Quantization in Versatile Video Coding
The forthcoming Versatile Video Coding (VVC) standard adopts the trellis...
read it
-
Spatial Language Representation with Multi-Level Geocoding
We present a multi-level geocoding model (MLG) that learns to associate ...
read it
-
Reversing the cycle: self-supervised deep stereo through enhanced monocular distillation
In many fields, self-supervised learning solutions are rapidly evolving ...
read it
-
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge
This paper describes the NPU system submitted to Interspeech 2020 Far-Fi...
read it
-
A Survey on Concept Factorization: From Shallow to Deep Representation Learning
The quality of learned features by representation learning determines th...
read it
-
Improving Semantic Segmentation via Decoupled Body and Edge Supervision
Existing semantic segmentation approaches either aim to improve the obje...
read it
-
A novel deep learning-based method for monochromatic image synthesis from spectral CT using photon-counting detectors
With the growing technology of photon-counting detectors (PCD), spectral...
read it
-
XingGAN for Person Image Generation
We propose a novel Generative Adversarial Network (XingGAN or CrossingGA...
read it
-
How to trust unlabeled data? Instance Credibility Inference for Few-Shot Learning
Deep learning based models have excelled in many computer vision task an...
read it
-
Egocentric Action Recognition by Video Attention and Temporal Context
We present the submission of Samsung AI Centre Cambridge to the CVPR2020...
read it
-
PriceAggregator: An Intelligent System for Hotel Price Fetching
This paper describes the hotel price aggregation system - PriceAggregato...
read it
-
Self-supervised Video Object Segmentation
The objective of this paper is self-supervised representation learning, ...
read it
-
Long-Term Cloth-Changing Person Re-identification
Person re-identification (Re-ID) aims to match a target person across ca...
read it
-
SentPWNet: A Unified Sentence Pair Weighting Network for Task-specific Sentence Embedding
Pair-based metric learning has been widely adopted to learn sentence emb...
read it
-
Style Normalization and Restitution for Generalizable Person Re-identification
Existing fully-supervised person re-identification (ReID) methods usuall...
read it
-
Neural Collaborative Filtering vs. Matrix Factorization Revisited
Embedding based models have been the state of the art in collaborative f...
read it
-
A Survey on Deep Learning for Neuroimaging-based Brain Disorder Analysis
Deep learning has been recently used for the analysis of neuroimages, su...
read it
-
3D Printed Brain-Controlled Robot-Arm Prosthetic via Embedded Deep Learning from sEMG Sensors
In this paper, we present our work on developing robot arm prosthetic vi...
read it
-
In-Vehicle Object Detection in the Wild for Driverless Vehicles
In-vehicle human object identification plays an important role in vision...
read it
-
Learning to fool the speaker recognition
Due to the widespread deployment of fingerprint/face/speaker recognition...
read it
-
Universal Adversarial Perturbations Generative Network for Speaker Recognition
Attacking deep learning based biometric systems has drawn more and more ...
read it
-
Direct Speech-to-image Translation
Direct speech-to-image translation without text is an interesting and us...
read it
-
Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
Spatial pooling has been proven highly effective in capturing long-range...
read it
-
Instance Credibility Inference for Few-Shot Learning
Few-shot learning (FSL) aims to recognize new objects with extremely lim...
read it
-
What Deep CNNs Benefit from Global Covariance Pooling: An Optimization Perspective
Recent works have demonstrated that global covariance pooling (GCP) has ...
read it
-
Feedback Graph Convolutional Network for Skeleton-based Action Recognition
Skeleton-based action recognition has attracted considerable attention i...
read it
-
Superbloom: Bloom filter meets Transformer
We extend the idea of word pieces in natural language models to machine ...
read it
-
Selective Convolutional Network: An Efficient Object Detector with Ignoring Background
It is well known that attention mechanisms can effectively improve the p...
read it
-
Semantic Discord: Finding Unusual Local Patterns for Time Series
Finding anomalous subsequence in a long time series is a very important ...
read it