
Mediated Uncoupled Learning: Learning Functions without Direct Inputoutput Correspondences
Ordinary supervised learning is useful when we have paired training data...
PositiveUnlabeled Classification under ClassPrior Shift: A Priorinvariant Approach Based on Density Ratio Estimation
Learning from positive and unlabeled (PU) data is an important problem i...
Seeing Differently, Acting Similarly: Imitation Learning with Heterogeneous Observations
In many realworld imitation learning tasks, the demonstrator and the le...
MultiClass Classification from SingleClass Data with Confidences
Can we learn a multiclass classifier from only data of a single class? ...
Probabilistic Margins for Instance Reweighting in Adversarial Training
Reweighting adversarial data during training has been recently shown to ...
On the Robustness of Average Losses for PartialLabel Learning
Partiallabel (PL) learning is a typical weakly supervised classificatio...
Loss function based secondorder Jensen inequality and its application to particle variational inference
Bayesian model averaging, obtained as the expectation of a likelihood fu...
Instance Correction for Learning with Openset Noisy Labels
The problem of openset noisy labels denotes that part of training data ...
Sample Selection with Uncertainty of Losses for Learning with Noisy Labels
In learning with noisy labels, the sample selection approach is very pop...
A unified view of likelihood ratio and reparameterization gradients
Reparameterization (RP) and likelihood ratio (LR) gradient estimators ar...
NoiLIn: Do Noisy Labels Always Hurt Adversarial Training?
Adversarial training (AT) based on minimax optimization is a popular lea...
PositiveNegative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization
It is wellknown that stochastic gradient noise (SGN) acts as implicit r...
Approximating InstanceDependent Noise via InstanceConfidence Embedding
Label noise in multiclass classification is a major obstacle to the depl...
Discovering Diverse Solutions in Deep Reinforcement Learning
Reinforcement learning (RL) algorithms are typically limited to learning...
Lowerbounded proper losses for weakly supervised classification
This paper discusses the problem of weakly supervised learning of classi...
LocalDrop: A Hybrid Regularization for Deep Neural Networks
In neural networks, developing regularization algorithms to settle overf...
Incorporating Causal Graphical Prior Knowledge into Predictive Modeling via Simple Data Augmentation
Causal graphs (CGs) are compact representations of the knowledge of the ...
Learning from SimilarityConfidence Data
Weakly supervised learning has drawn considerable attention recently to ...
CIFS: Improving Adversarial Robustness of CNNs via Channelwise Importancebased Feature Selection
We investigate the adversarial robustness of CNNs from the perspective o...
Understanding the Interaction of Adversarial Training with Noisy Labels
Noisy labels (NL) and adversarial examples both undermine trained models...
Learning Noise Transition Matrix from Only Noisy Labels via Total Variation Regularization
Many weakly supervised classification methods employ a noise transition ...
Provably Endtoend LabelNoise Learning without Anchor Points
In labelnoise learning, the transition matrix plays a key role in build...
Learning DiverseStructured Networks for Adversarial Robustness
In adversarial training (AT), the main focus has been the objective and ...
Binary Classification from Multiple Unlabeled Datasets via Surrogate Set Classification
To cope with high annotation costs, training a classifier only from weak...
Sourcefree Domain Adaptation via Distributional Alignment by Matching Batch Normalization Statistics
In this paper, we propose a novel domain adaptation method for the sourc...
A Symmetric Loss Perspective of Reliable Machine Learning
When minimizing the empirical risk in binary classification, it is a com...
Combinatorial Pure Exploration with Fullbandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation
Combinatorial optimization is one of the fundamental research fields tha...
Stable Weight Decay Regularization
Weight decay is a popular regularization technique for training of deep ...
On Focal Loss for ClassPosterior Probability Estimation: A Theoretical Perspective
The focal loss has demonstrated its effectiveness in many realworld app...
Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting
Deep learning is often criticized by two serious issues which rarely exi...
A Survey of Labelnoise Representation Learning: Past, Present and Future
Classical machine learning implicitly assumes that labels of the trainin...
Binary classification with ambiguous training data
In supervised learning, we often face with ambiguous (A) samples that ar...
Classification with Rejection Based on Costsensitive Classification
The goal of classification with rejection is to avoid risky misclassific...
Maximum Mean Discrepancy is Aware of Adversarial Attacks
The maximum mean discrepancy (MMD) test, as a representative twosample ...
Robust Imitation Learning from Noisy Demonstrations
Learning from noisy demonstrations is a practical but highly challenging...
Pointwise Binary Classification with Pairwise Confidence Comparisons
Ordinary (pointwise) binary classification aims to learn a binary classi...
Geometryaware Instancereweighted Adversarial Training
In adversarial machine learning, there was a common belief that robustne...
Provably Consistent PartialLabel Learning
Partiallabel learning (PLL) is a multiclass classification problem, wh...
A Onestep Approach to Covariate Shift Adaptation
A default assumption in many machine learning scenarios is that the trai...
Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels
In weakly supervised learning, unbiased risk estimator(URE) is a powerfu...
Adai: Separating the Effects of Adaptive Learning Rate and Momentum Inertia
Adaptive Momentum Estimation (Adam), which combines Adaptive Learning Ra...
Online Dense Subgraph Discovery via BlurredGraph Feedback
Dense subgraph discovery aims to find a dense component in edgeweighted...
Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent
In continual learning settings, deep neural networks are prone to catast...
Couplingbased Invertible Neural Networks Are Universal Diffeomorphism Approximators
Invertible neural networks based on coupling flows (CFINNs) have variou...
Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring
We investigate finite stochastic partial monitoring, which is a general ...
LFDProtoNet: Prototypical Network Based on Local Fisher Discriminant Analysis for Fewshot Learning
The prototypical network (ProtoNet) is a fewshot learning framework tha...
Partsdependent Label Noise: Towards Instancedependent Label Noise
Learning with the instancedependent label noise is challenging, because...
Dual T: Reducing Estimation Error for Transition Matrix in Labelnoise Learning
The transition matrix, denoting the transition relationship from clean l...
Similaritybased Classification: Connecting Similarity Learning to Binary Classification
In realworld classification problems, pairwise supervision (i.e., a pai...
Rethinking Importance Weighting for Deep Learning under Distribution Shift
Under distribution shift (DS) where the training data distribution diffe...
Masashi Sugiyama
Director  RIKEN Center for Advanced Intelligence Project, Professor at University of Tokyo