
-
CVC: Contrastive Learning for Non-parallel Voice Conversion
Cycle consistent generative adversarial network (CycleGAN) and variation...
read it
-
Unsupervised Monocular Depth Learning in Dynamic Scenes
We present a method for jointly training the estimation of depth, ego-mo...
read it
-
CLOUD: Contrastive Learning of Unsupervised Dynamics
Developing agents that can perform complex control tasks from high dimen...
read it
-
LID 2020: The Learning from Imperfect Data Challenge Results
Learning from imperfect data becomes an issue in many industrial applica...
read it
-
SEMI: Self-supervised Exploration via Multisensory Incongruity
Efficient exploration is a long-standing problem in reinforcement learni...
read it
-
ServiceNet: A P2P Service Network
Given a large number of online services on the Internet, from time to ti...
read it
-
Multivariate Time-series Anomaly Detection via Graph Attention Network
Anomaly detection on multivariate time-series is of great importance in ...
read it
-
TNT: Target-driveN Trajectory Prediction
Predicting the future behavior of moving agents is essential for real wo...
read it
-
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
We solve a challenging yet practically useful variant of 3D Bin Packing ...
read it
-
A Data Streaming Process Framework for Autonomous Driving By Edge
In recent years, with the rapid development of sensing technology and th...
read it
-
VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Behavior prediction in dynamic, multi-agent systems is an important prob...
read it
-
Music Gesture for Visual Sound Separation
Recent deep learning approaches have achieved impressive performance on ...
read it
-
AlignNet: A Unifying Approach to Audio-Visual Alignment
We present AlignNet, a model that synchronizes videos with reference aud...
read it
-
Neural network with data augmentation in multi-objective prediction of multi-stage pump
A multi-objective prediction method of multi-stage pump method based on ...
read it
-
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
The research community has increasing interest in autonomous driving res...
read it
-
Scalability in Perception for Autonomous Driving: An Open Dataset Benchmark
The research community has increasing interest in autonomous driving res...
read it
-
Self-supervised Moving Vehicle Tracking with Stereo Sound
Humans are able to localize objects in the environment using both visual...
read it
-
Active Scene Understanding via Online Semantic Reconstruction
We propose a novel approach to robot-operated active understanding of un...
read it
-
Self-Supervised Audio-Visual Co-Segmentation
Segmenting objects in images and separating sound sources in audio are c...
read it
-
The Sound of Motions
Sounds originate from object motions and vibrations of surrounding air. ...
read it
-
The Sound of Pixels
We introduce PixelPlayer, a system that, by leveraging large amounts of ...
read it
-
SLAC: A Sparsely Labeled Dataset for Action Classification and Localization
This paper describes a procedure for the creation of large-scale video d...
read it
-
Open Vocabulary Scene Parsing
Recognizing arbitrary objects in the wild has been a challenging problem...
read it
-
Semantic Understanding of Scenes through the ADE20K Dataset
Scene parsing, or recognizing and segmenting objects and stuff in an ima...
read it
-
Loss Functions for Neural Networks for Image Processing
Neural networks are becoming central in several areas of computer vision...
read it