
-
Study On Coding Tools Beyond Av1
The Alliance for Open Media has recently initiated coding tool explorati...
read it
-
SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains
This paper observes that there is an issue of high frequencies missing i...
read it
-
Fast and Robust Bin-picking System for Densely Piled Industrial Objects
Objects grasping, also known as the bin-picking, is one of the most comm...
read it
-
Overview of Screen Content Coding in Recently Developed Video Coding Standards
In recent years, screen content (SC) video including computer generated ...
read it
-
FeatherTTS: Robust and Efficient attention based Neural TTS
Attention based neural TTS is elegant speech synthesis pipeline and has ...
read it
-
Unsupervised Point Cloud Registration via Salient Points Analysis (SPA)
An unsupervised point cloud registration method, called salient points a...
read it
-
Unsupervised Feedforward Feature (UFF) Learning for Point Cloud Classification and Segmentation
In contrast to supervised backpropagation-based feature learning in deep...
read it
-
Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation
Pose-guided person image generation and animation aim to transform a sou...
read it
-
Toward Zero-Shot Unsupervised Image-to-Image Translation
Recent studies have shown remarkable success in unsupervised image-to-im...
read it
-
Learning Model-Blind Temporal Denoisers without Ground Truths
Denoisers trained with synthetic data often fail to cope with the divers...
read it
-
AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN
This paper investigates how to leverage a DurIAN-based average model to ...
read it
-
FeatherWave: An efficient high-fidelity neural vocoder with multi-band linear prediction
In this paper, we propose the FeatherWave, yet another variant of WaveRN...
read it
-
PointHop++: A Lightweight Learning Model on Point Sets for 3D Classification
The PointHop method was recently proposed by Zhang et al. for 3D point c...
read it
-
RoIMix: Proposal-Fusion among Multiple Images for Underwater Object Detection
Generic object detection algorithms have proven their excellent performa...
read it
-
Video-based compression for plenoptic point clouds
The plenoptic point cloud that has multiple colors from various directio...
read it
-
C3DVQA: Full-Reference Video Quality Assessment with 3D Convolutional Neural Network
Traditional video quality assessment (VQA) methods evaluate localized pi...
read it
-
Multi-mapping Image-to-Image Translation via Learning Disentanglement
Recent advances of image-to-image translation focus on learning the one-...
read it
-
StructureFlow: Image Inpainting via Structure-aware Appearance Flow
Image inpainting techniques have shown significant improvements by using...
read it
-
PointHop: An Explainable Machine Learning Method for Point Cloud Classification
An explainable machine learning method for point cloud classification, c...
read it
-
Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds
Point cloud is a fundamental 3D representation which is widely used in r...
read it
-
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
Video anomaly detection under weak labels is formulated as a typical mul...
read it
-
Occupancy-map-based rate distortion optimization for video-based point cloud compression
The state-of-the-art video-based point cloud compression scheme projects...
read it
-
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder
Neural networks based vocoders, typically the WaveNet, have achieved spe...
read it
-
BLP - Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization
Despite tremendous progress achieved in temporal action detection, state...
read it