
Binary Classification from Positive Data with Skewed Confidence
Positiveconfidence (Pconf) classification [Ishida et al., 2018] is a pr...
read it

Classification from Pairwise Similarities/Dissimilarities and Unlabeled Data via Empirical Risk Minimization
Pairwise similarities and dissimilarities between data points might be e...
read it

Online Multiclass Classification Based on Prediction Margin for Partial Feedback
We consider the problem of online multiclass classification with partial...
read it

Imitation Learning from Imperfect Demonstration
Imitation learning (IL) aims to learn an optimal policy from demonstrati...
read it

Solving NPHard Problems on Graphs by Reinforcement Learning without Domain Knowledge
We propose an algorithm based on reinforcement learning for solving NPh...
read it

A Diffusion Theory for Deep Learning Dynamics: Stochastic Gradient Descent Escapes From Sharp Minima Exponentially Fast
Stochastic optimization algorithms, such as Stochastic Gradient Descent ...
read it

Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PACBayesian Analysis
The notion of flat minima has played a key role in the generalization pr...
read it

A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Reparameterization (RP) and likelihood ratio (LR) gradient estimators ar...
read it

Timevarying Gaussian Process Bandit Optimization with Nonconstant Evaluation Time
The Gaussian process bandit is a problem in which we want to find a maxi...
read it

Pumpout: A Meta Approach for Robustly Training Deep Neural Networks with Noisy Labels
It is challenging to train deep neural networks robustly on the industri...
read it

Attacks Which Do Not Kill Training Make Adversarial Learning Stronger
Adversarial training based on the minimax formulation is necessary for o...
read it

SemiSupervised Ordinal Regression Based on Empirical Risk Minimization
We consider the semisupervised ordinal regression problem, where unlabe...
read it

Active Deep Qlearning with Demonstration
Recent research has shown that although Reinforcement Learning (RL) can ...
read it

Confidence Scores Make Instancedependent Labelnoise Learning Possible
Learning with noisy labels has drawn a lot of attention. In this area, m...
read it

Fewshot Domain Adaptation by Causal Mechanism Transfer
We study fewshot supervised domain adaptation (DA) for regression probl...
read it

Unsupervised Domain Adaptation Based on Sourceguided Discrepancy
Unsupervised domain adaptation is the problem setting where data generat...
read it

Classification from Triplet Comparison Data
Learning from triplet comparison data has been extensively studied in th...
read it

Towards Mixture Proportion Estimation without Irreducibility
Mixture proportion estimation (MPE) is a fundamental problem of practica...
read it

Hierarchical Policy Search via ReturnWeighted Density Estimation
Learning an optimal policy from a multimodal reward function is a chall...
read it

Binary Classification from PositiveConfidence Data
Reducing labeling costs in supervised learning is a critical issue in ma...
read it

Variational Inference based on Robust Divergences
Robustness to outliers is a central issue in realworld machine learning...
read it

Good Arm Identification via Bandit Feedback
In this paper, we consider and discuss a new stochastic multiarmed band...
read it

Fully adaptive algorithm for pure exploration in linear bandits
We propose the first fullyadaptive algorithm for pure exploration in li...
read it

Estimation of SquaredLoss Mutual Information from Positive and Unlabeled Data
Capturing inputoutput dependency is an important task in statistical da...
read it

ModeSeeking Clustering and Density Ridge Estimation via Direct Estimation of DensityDerivativeRatios
Modes and ridges of the probability density function behind observed dat...
read it

Expectation Propagation for tExponential Family Using QAlgebra
Exponential family distributions are highly useful in machine learning s...
read it

Deep Reinforcement Learning with Relative Entropy Stochastic Search
Many reinforcement learning methods for continuous control tasks are bas...
read it

Learning from Complementary Labels
Collecting labeled data is costly and thus a critical bottleneck in real...
read it

Bayesian Nonparametric PoissonProcess Allocation for TimeSequence Modeling
Analyzing the underlying structure of multiple timesequences provides i...
read it

SemiSupervised AUC Optimization based on PositiveUnlabeled Learning
Maximizing the area under the receiver operating characteristic curve (A...
read it

Stochastic Divergence Minimization for Biterm Topic Model
As the emergence and the thriving development of social networks, a huge...
read it

PositiveUnlabeled Learning with NonNegative Risk Estimator
From only positive (P) and unlabeled (U) data, a binary classifier could...
read it

Learning Discrete Representations via Information Maximizing SelfAugmented Training
Learning discrete representations of data is a central machine learning ...
read it

Policy Search with HighDimensional Context Variables
Direct contextual policy search methods learn to improve policy paramete...
read it

Revisiting Distributionally Robust Supervised Learning in Classification
Distributionally Robust Supervised Learning (DRSL) is necessary for buil...
read it

Classprior Estimation for Learning from Positive and Unlabeled Data
We consider the problem of estimating the class prior in an unlabeled da...
read it

Theoretical Comparisons of PositiveUnlabeled Learning against PositiveNegative Learning
In PU learning, a binary classifier is trained from positive (P) and unl...
read it

WhiteningFree LeastSquares NonGaussian Component Analysis
NonGaussian component analysis (NGCA) is an unsupervised linear dimensi...
read it

NonGaussian Component Analysis with LogDensity Gradient Estimation
NonGaussian component analysis (NGCA) is aimed at identifying a linear ...
read it

Faster Stochastic Variational Inference using ProximalGradient Methods with General Divergence Functions
Several recent works have explored stochastic gradient methods for varia...
read it

Theoretical and Experimental Analyses of TensorBased Regression and Classification
We theoretically and experimentally investigate tensorbased regression ...
read it

Direct Estimation of the Derivative of Quadratic Mutual Information with Application in Supervised Dimension Reduction
A typical goal of supervised dimension reduction is to find a lowdimens...
read it

Regularized MultiTask Learning for MultiDimensional LogDensity Gradient Estimation
Logdensity gradient estimation is a fundamental statistical problem and...
read it

Structure Learning of Partitioned Markov Networks
We learn the structure of a Markov Network between two groups of random ...
read it

Reinterpreting the Transformation Posterior in Probabilistic Image Registration
Probabilistic image registration methods estimate the posterior distribu...
read it

Support Consistency of Direct SparseChange Learning in Markov Networks
We study the problem of learning sparse structure changes between two Ma...
read it

Direct DensityDerivative Estimation and Its Application in KLDivergence Approximation
Estimation of density derivatives is a versatile tool in statistical dat...
read it

Conditional Density Estimation with Dimensionality Reduction via SquaredLoss Conditional Entropy Minimization
Regression aims at estimating the conditional mean of output given input...
read it

Clustering via Mode Seeking by Direct Estimation of the Gradient of a LogDensity
Mean shift clustering finds the modes of the data probability density by...
read it

Transductive Learning with Multiclass Volume Approximation
Given a hypothesis space, the large volume principle by Vladimir Vapnik ...
read it
Masashi Sugiyama
is this you? claim profile
Director  RIKEN Center for Advanced Intelligence Project, Professor at University of Tokyo