
A Onestep Approach to Covariate Shift Adaptation
A default assumption in many machine learning scenarios is that the trai...
read it

Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels
In weakly supervised learning, unbiased risk estimator(URE) is a powerfu...
read it

Adai: Separating the Effects of Adaptive Learning Rate and Momentum Inertia
Adaptive Momentum Estimation (Adam), which combines Adaptive Learning Ra...
read it

Online Dense Subgraph Discovery via BlurredGraph Feedback
Dense subgraph discovery aims to find a dense component in edgeweighted...
read it

Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent
In continual learning settings, deep neural networks are prone to catast...
read it

Couplingbased Invertible Neural Networks Are Universal Diffeomorphism Approximators
Invertible neural networks based on coupling flows (CFINNs) have variou...
read it

Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring
We investigate finite stochastic partial monitoring, which is a general ...
read it

LFDProtoNet: Prototypical Network Based on Local Fisher Discriminant Analysis for Fewshot Learning
The prototypical network (ProtoNet) is a fewshot learning framework tha...
read it

Partsdependent Label Noise: Towards Instancedependent Label Noise
Learning with the instancedependent label noise is challenging, because...
read it

Dual T: Reducing Estimation Error for Transition Matrix in Labelnoise Learning
The transition matrix, denoting the transition relationship from clean l...
read it

Similaritybased Classification: Connecting Similarity Learning to Binary Classification
In realworld classification problems, pairwise supervision (i.e., a pai...
read it

Rethinking Importance Weighting for Deep Learning under Distribution Shift
Under distribution shift (DS) where the training data distribution diffe...
read it

Calibrated Surrogate Losses for Adversarially Robust Classification
Adversarially robust classification seeks a classifier that is insensiti...
read it

Learning from Aggregate Observations
We study the problem of learning from aggregate observations where super...
read it

Do Public Datasets Assure Unbiased Comparisons for Registration Evaluation?
With the increasing availability of new image registration approaches, a...
read it

Timevarying Gaussian Process Bandit Optimization with Nonconstant Evaluation Time
The Gaussian process bandit is a problem in which we want to find a maxi...
read it

Attacks Which Do Not Kill Training Make Adversarial Learning Stronger
Adversarial training based on the minimax formulation is necessary for o...
read it

Do We Need Zero Training Loss After Achieving Zero Training Error?
Overparameterized deep networks have the capacity to memorize training d...
read it

Progressive Identification of True Labels for PartialLabel Learning
Partiallabel learning is one of the important weakly supervised learnin...
read it

Towards Mixture Proportion Estimation without Irreducibility
Mixture proportion estimation (MPE) is a fundamental problem of practica...
read it

Fewshot Domain Adaptation by Causal Mechanism Transfer
We study fewshot supervised domain adaptation (DA) for regression probl...
read it

A Diffusion Theory for Deep Learning Dynamics: Stochastic Gradient Descent Escapes From Sharp Minima Exponentially Fast
Stochastic optimization algorithms, such as Stochastic Gradient Descent ...
read it

Learning from Noisy Similar and Dissimilar Data
With the widespread use of machine learning for classification, it becom...
read it

Binary Classification from Positive Data with Skewed Confidence
Positiveconfidence (Pconf) classification [Ishida et al., 2018] is a pr...
read it

Confidence Scores Make Instancedependent Labelnoise Learning Possible
Learning with noisy labels has drawn a lot of attention. In this area, m...
read it

Where is the Bottleneck of Adversarial Learning with Unlabeled Data?
Deep neural networks (DNNs) are incredibly brittle due to adversarial ex...
read it

Scalable Evaluation and Improvement of Document Set Expansion via Neural PositiveUnlabeled Learning
We consider the situation in which a user has collected a small set of d...
read it

Mitigating Overfitting in Supervised Classification from Two Unlabeled Datasets: A Consistent Risk Correction Approach
From two unlabeled (U) datasets with different class priors, we can trai...
read it

A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Reparameterization (RP) and likelihood ratio (LR) gradient estimators ar...
read it

Learning from Indirect Observations
Weaklysupervised learning is a paradigm for alleviating the scarcity of...
read it

Learning Only from Relevant Keywords and Unlabeled Documents
We consider a document classification problem where document labels are ...
read it

Reducing Overestimation Bias in MultiAgent Domains Using Double Centralized Critics
Many real world tasks require multiple agents to work together. Multiag...
read it

VILD: Variational Imitation Learning with Diversequality Demonstrations
The goal of imitation learning (IL) is to learn a good policy from high...
read it

Pilot Study on Verifying the Monotonic Relationship between Error and Uncertainty in Deformable Registration for Neurosurgery
In imageguided neurosurgery, deformable registration currently is not a...
read it

Classification from Triplet Comparison Data
Learning from triplet comparison data has been extensively studied in th...
read it

Direction Matters: On InfluencePreserving Graph Summarization and Maxcut Principle for Directed Graphs
Summarizing largescaled directed graphs into smallscale representation...
read it

Are Anchor Points Really Indispensable in LabelNoise Learning?
In labelnoise learning, noise transition matrix, denoting the probabili...
read it

Uncoupled Regression from Pairwise Comparison Data
Uncoupled regression is the problem to learn a model from unlabeled data...
read it

Calibrated Surrogate Maximization of Linearfractional Utility in Binary Classification
Complex classification performance metrics such as the F_βmeasure and J...
read it

Fast and Robust Rank Aggregation against Model Misspecification
In rank aggregation, preferences from different users are summarized int...
read it

Solving NPHard Problems on Graphs by Reinforcement Learning without Domain Knowledge
We propose an algorithm based on reinforcement learning for solving NPh...
read it

Butterfly: A Panacea for All Difficulties in Wildly Unsupervised Domain Adaptation
In unsupervised domain adaptation (UDA), classifiers for the target doma...
read it

Butterfly: Robust Onestep Approach towards Wildlyunsupervised Domain Adaptation
Unsupervised domain adaptation (UDA) trains with clean labeled data in s...
read it

Classification from Pairwise Similarities/Dissimilarities and Unlabeled Data via Empirical Risk Minimization
Pairwise similarities and dissimilarities between data points might be e...
read it

Zeroshot Domain Adaptation Based on Attribute Information
In this paper, we propose a novel domain adaptation method that can be a...
read it

Polynomialtime Algorithms for Combinatorial Pure Exploration with Fullbandit Feedback
We study the problem of stochastic combinatorial pure exploration (CPE),...
read it

Online Multiclass Classification Based on Prediction Margin for Partial Feedback
We consider the problem of online multiclass classification with partial...
read it

SemiSupervised Ordinal Regression Based on Empirical Risk Minimization
We consider the semisupervised ordinal regression problem, where unlabe...
read it

New Tricks for Estimating Gradients of Expectations
We derive a family of Monte Carlo estimators for gradients of expectatio...
read it

On Possibility and Impossibility of Multiclass Classification with Rejection
We investigate the problem of multiclass classification with rejection, ...
read it
Masashi Sugiyama
is this you? claim profile
Director  RIKEN Center for Advanced Intelligence Project, Professor at University of Tokyo