
-
DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances
Recent advances in pre-trained language models have significantly improv...
read it
-
Context-Aware Answer Extraction in Question Answering
Extractive QA models have shown very promising performance in predicting...
read it
-
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding
Language model pre-training has shown promising results in various downs...
read it
-
Which Strategies Matter for Noisy Label Classification? Insight into Loss and Uncertainty
Label noise is a critical factor that degrades the generalization perfor...
read it
-
Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous Graphs
Graph neural networks have shown superior performance in a wide range of...
read it
-
CareCall: a Call-Based Active Monitoring Dialog Agent for Managing COVID-19 Pandemic
Tracking suspected cases of COVID-19 is crucial to suppressing the sprea...
read it
-
Efficient Active Learning for Automatic Speech Recognition via Augmented Consistency Regularization
The cost of labeling transcriptions for large speech corpora becomes a b...
read it
-
Slowing Down the Weight Norm Increase in Momentum-based Optimizers
Normalization techniques, such as batch normalization (BN), have led to ...
read it
-
Graphs, Entities, and Step Mixture
Existing approaches for graph neural networks commonly suffer from the o...
read it
-
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers
Automatic speech recognition (ASR) via call is essential for various app...
read it
-
Modeling Musical Onset Probabilities via Neural Distribution Learning
Musical onset detection can be formulated as a time-to-event (TTE) or ti...
read it
-
StarGAN v2: Diverse Image Synthesis for Multiple Domains
A good image-to-image translation model should learn a mapping between d...
read it
-
Neural Approximation of an Auto-Regressive Process through Confidence Guided Sampling
We propose a generic confidence-based approximation that can be plugged ...
read it
-
Which Ads to Show? Advertisement Image Assessment with Auxiliary Information via Multi-step Modality Fusion
Assessing aesthetic preference is a fundamental task related to human co...
read it
-
Tripartite Heterogeneous Graph Propagation for Large-scale Social Recommendation
Graph Neural Networks (GNNs) have been emerging as a promising method fo...
read it
-
Photorealistic Style Transfer via Wavelet Transforms
Recent style transfer models have provided promising artistic results. H...
read it
-
Phase-aware Speech Enhancement with Deep Complex U-Net
Most deep learning-based models for speech enhancement have mainly focus...
read it
-
Large-Scale Answerer in Questioner's Mind for Visual Dialog Question Generation
Answerer in Questioner's Mind (AQM) is an information-theoretic framewor...
read it
-
Multi-Domain Processing via Hybrid Denoising Networks for Speech Enhancement
We present a hybrid framework that leverages the trade-off between tempo...
read it
-
CHOPT : Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning Platforms
Many hyperparameter optimization (HyperOpt) methods assume restricted co...
read it
-
NSML: Meet the MLaaS platform with a real-world case study
The boom of deep learning induced many industries and academies to intro...
read it
-
Reinforcement Learning based Recommender System using Biclustering Technique
A recommender system aims to recommend items that a user is interested i...
read it
-
NSML: A Machine Learning Platform That Enables You to Focus on Your Models
Machine learning libraries such as TensorFlow and PyTorch simplify model...
read it
-
Automatic Music Highlight Extraction using Convolutional Recurrent Attention Networks
Music highlights are valuable contents for music services. Most methods ...
read it
-
Highrisk Prediction from Electronic Medical Records via Deep Attention Networks
Predicting highrisk vascular diseases is a significant issue in the medi...
read it
-
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
Recent studies have shown remarkable success in image-to-image translati...
read it
-
Representation Learning of Music Using Artist Labels
Recently, feature representation by learning algorithms has drawn great ...
read it
-
Energy-Based Sequence GANs for Recommendation and Their Connection to Imitation Learning
Recommender systems aim to find an accurate and efficient mapping from h...
read it
-
Dual Attention Networks for Multimodal Reasoning and Matching
We propose Dual Attention Networks (DANs) which jointly leverage visual ...
read it
-
Hadamard Product for Low-rank Bilinear Pooling
Bilinear models provide rich representations compared with linear models...
read it
-
Multimodal Residual Learning for Visual QA
Deep neural networks continue to advance the state-of-the-art of image r...
read it