b'Issei Sato'

research

∙ 07/26/2023

Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?

Existing analyses of the expressive capacity of Transformer models have ...

0 Tokio Kajitsuka, et al. ∙

research

∙ 05/26/2023

Exploring Weight Balancing on Long-Tailed Recognition Problem

Recognition problems in long-tailed data, where the sample size per clas...

0 Naoya Hasegawa, et al. ∙

research

∙ 06/02/2022

Excess risk analysis for epistemic uncertainty with application to variational inference

We analyze the epistemic uncertainty (EU) of supervised learning in Baye...

0 Futoshi Futami, et al. ∙

research

∙ 05/15/2022

Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective

The lottery ticket hypothesis (LTH) has attracted attention because it c...

0 Keitaro Sakamoto, et al. ∙

research

∙ 04/29/2022

Goldilocks-curriculum Domain Randomization and Fractal Perlin Noise with Application to Sim2Real Pneumonia Lesion Detection

A computer-aided detection (CAD) system based on machine learning is exp...

0 Takahiro Suzuki, et al. ∙

research

∙ 04/18/2022

Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey

Representation learning enables us to automatically extract generic feat...

0 Kento Nozawa, et al. ∙

research

∙ 04/11/2022

Neural Lagrangian Schrödinger bridge

Population dynamics is the study of temporal and spatial variation in th...

7 Takeshi Koshizuka, et al. ∙

research

∙ 10/11/2021

A Closer Look at Prototype Classifier for Few-shot Image Classification

The prototypical network is a prototype classifier based on meta-learnin...

0 Mingcheng Hou, et al. ∙

research

∙ 08/31/2021

Disentanglement Analysis with Partial Information Decomposition

Given data generated from multiple factors of variation that cooperative...

0 Seiya Tokui, et al. ∙

research

∙ 06/09/2021

Loss function based second-order Jensen inequality and its application to particle variational inference

Bayesian model averaging, obtained as the expectation of a likelihood fu...

0 Futoshi Futami, et al. ∙

research

∙ 03/17/2021

Toward Neural-Network-Guided Program Synthesis and Verification

We propose a novel framework of program and invariant synthesis called n...

9 Naoki Kobayashi, et al. ∙

research

∙ 02/24/2021

Abelian Neural Networks

We study the problem of modeling a binary operation that satisfies some ...

0 Kenshin Abe, et al. ∙

research

∙ 02/13/2021

Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning

Instance discriminative self-supervised representation learning has been...

0 Kento Nozawa, et al. ∙

research

∙ 02/01/2021

Binary Classification from Multiple Unlabeled Datasets via Surrogate Set Classification

To cope with high annotation costs, training a classifier only from weak...

0 Shida Lei, et al. ∙

research

∙ 11/23/2020

Stable Weight Decay Regularization

Weight decay is a popular regularization technique for training of deep ...

9 Zeke Xie, et al. ∙

research

∙ 11/12/2020

Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting

Deep learning is often criticized by two serious issues which rarely exi...

1 Zeke Xie, et al. ∙

research

∙ 08/03/2020

Classification from Ambiguity Comparisons

Labeling data is an unavoidable pre-processing procedure for most machin...

8 Zhenghang Cui, et al. ∙

research

∙ 07/03/2020

Diagnostic Uncertainty Calibration: Towards Reliable Machine Predictions in Medical Domain

Label disagreement between human experts is a common issue in the medica...

68 Takahiro Mimori, et al. ∙

research

∙ 06/29/2020

Adai: Separating the Effects of Adaptive Learning Rate and Momentum Inertia

Adaptive Momentum Estimation (Adam), which combines Adaptive Learning Ra...

8 Zeke Xie, et al. ∙

research

∙ 06/15/2020

LFD-ProtoNet: Prototypical Network Based on Local Fisher Discriminant Analysis for Few-shot Learning

The prototypical network (ProtoNet) is a few-shot learning framework tha...

0 Kei Mukaiyama, et al. ∙

research

∙ 06/13/2020

γ-ABC: Outlier-Robust Approximate Bayesian Computation based on Robust Divergence Estimator

Making a reliable inference in complex models is an essential issue in s...

0 Masahiro Fujisawa, et al. ∙

research

∙ 06/11/2020

Similarity-based Classification: Connecting Similarity Learning to Binary Classification

In real-world classification problems, pairwise supervision (i.e., a pai...

6 Han Bao, et al. ∙

research

∙ 05/08/2020

Sequential Gallery for Interactive Visual Design Optimization

Visual design tasks often involve tuning many design parameters. For exa...

0 Yuki Koyama, et al. ∙

research

∙ 03/10/2020

Time-varying Gaussian Process Bandit Optimization with Non-constant Evaluation Time

The Gaussian process bandit is a problem in which we want to find a maxi...

9 Hideaki Imamura, et al. ∙

research

∙ 02/10/2020

Few-shot Domain Adaptation by Causal Mechanism Transfer

We study few-shot supervised domain adaptation (DA) for regression probl...

3 Takeshi Teshima, et al. ∙

research

∙ 02/10/2020

A Diffusion Theory for Deep Learning Dynamics: Stochastic Gradient Descent Escapes From Sharp Minima Exponentially Fast

Stochastic optimization algorithms, such as Stochastic Gradient Descent ...

15 Zeke Xie, et al. ∙

research

∙ 11/20/2019

Bayesian interpretation of SGD as Ito process

The current interpretation of stochastic gradient descent (SGD) as a sto...

20 Soma Yokoi, et al. ∙

research

∙ 07/24/2019

Classification from Triplet Comparison Data

Learning from triplet comparison data has been extensively studied in th...

1 Zhenghang Cui, et al. ∙

research

∙ 06/24/2019

Interactive Subspace Exploration on Generative Image Modelling

Generative image modeling techniques such as GAN demonstrate highly conv...

1 Toby Chong Long Hin, et al. ∙

research

∙ 05/28/2019

Solving NP-Hard Problems on Graphs by Reinforcement Learning without Domain Knowledge

We propose an algorithm based on reinforcement learning for solving NP-h...

13 Kenshin Abe, et al. ∙

research

∙ 05/02/2019

Directing DNNs Attention for Facial Attribution Classification using Gradient-weighted Class Activation Mapping

Deep neural networks (DNNs) have a high accuracy on image classification...

12 Xi Yang, et al. ∙

research

∙ 04/26/2019

Classification from Pairwise Similarities/Dissimilarities and Unlabeled Data via Empirical Risk Minimization

Pairwise similarities and dissimilarities between data points might be e...

22 Takuya Shimada, et al. ∙

research

∙ 03/22/2019

Use of Ghost Cytometry to Differentiate Cells with Similar Gross Morphologic Characteristics

Imaging flow cytometry shows significant potential for increasing our un...

0 Hiroaki Adachi, et al. ∙

research

∙ 03/14/2019

On Learning from Ghost Imaging without Imaging

Computational ghost imaging is an imaging technique with which an object...

12 Issei Sato, et al. ∙

research

∙ 03/07/2019

On Transformations in Stochastic Gradient MCMC

Stochastic gradient Langevin dynamics (SGLD) is a widely used sampler fo...

8 Soma Yokoi, et al. ∙

research

∙ 02/12/2019

PAC-Bayes Analysis of Sentence Representation

Learning sentence vectors from an unlabeled corpus has attracted attenti...

16 Kento Nozawa, et al. ∙

research

∙ 02/04/2019

Online Multiclass Classification Based on Prediction Margin for Partial Feedback

We consider the problem of online multiclass classification with partial...

20 Takuo Kaneko, et al. ∙

research

∙ 02/01/2019

Multi-level Monte Carlo Variational Inference

In many statistics and machine learning frameworks, stochastic optimizat...

16 Masahiro Fujisawa, et al. ∙

research

∙ 01/31/2019

Semi-Supervised Ordinal Regression Based on Empirical Risk Minimization

We consider the semi-supervised ordinal regression problem, where unlabe...

6 Taira Tsuchiya, et al. ∙

research

∙ 01/15/2019

Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PAC-Bayesian Analysis

The notion of flat minima has played a key role in the generalization pr...

12 Yusuke Tsuzuku, et al. ∙

research

∙ 09/13/2018

Clipped Matrix Completion: a Remedy for Ceiling Effects

We consider the recovery of a low-rank matrix from its clipped observati...

0 Takeshi Teshima, et al. ∙

research

∙ 09/11/2018

On the Structural Sensitivity of Deep Convolutional Networks to the Directions of Fourier Basis Functions

Data-agnostic quasi-imperceptible perturbations on inputs can severely d...

0 Yusuke Tsuzuku, et al. ∙

research

∙ 09/11/2018

Unsupervised Domain Adaptation Based on Source-guided Discrepancy

Unsupervised domain adaptation is the problem setting where data generat...

2 Seiichi Kuroki, et al. ∙

research

∙ 05/21/2018

Frank-Wolfe Stein Sampling

In Bayesian inference, the posterior distributions are difficult to obta...

0 Futoshi Futami, et al. ∙

research

∙ 03/12/2018

Variational Inference for Gaussian Process with Panel Count Data

We present the first framework for Gaussian-process-modulated Poisson pr...

0 Hongyi Ding, et al. ∙

research

∙ 02/13/2018

Analysis of Minimax Error Rate for Crowdsourcing and Its Application to Worker Clustering Model

While crowdsourcing has become an important means to label data, crowdwo...

0 Hideaki Imamura, et al. ∙

research

∙ 02/12/2018

Lipschitz-Margin Training: Scalable Certification of Perturbation Invariance for Deep Neural Networks

High sensitivity of neural networks against malicious perturbations on i...

0 Yusuke Tsuzuku, et al. ∙

research

∙ 02/12/2018

Gaussian Process Classification with Privileged Information by Soft-to-Hard Labeling Transfer

Learning using privileged information is an attractive problem setting t...

0 Ryosuke Kamesawa, et al. ∙

research

∙ 10/18/2017

Variational Inference based on Robust Divergences

Robustness to outliers is a central issue in real-world machine learning...

1 Futoshi Futami, et al. ∙

research

∙ 09/26/2017

On the Model Shrinkage Effect of Gamma Process Edge Partition Models

The edge partition model (EPM) is a fundamental Bayesian nonparametric m...

0 Iku Ohama, et al. ∙

Issei Sato

Featured Co-authors

Sign in with Google

Consider DeepAI Pro