Tetsuro Morimura

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Masashi Sugiyama
222 publications
Hisashi Kashima
40 publications
Kun Zhao
33 publications
Takayuki Osogami
16 publications
Naoto Ohsaka
16 publications
Kenshi Abe
14 publications
Riku Togashi
14 publications
Toshiyuki Tanaka
11 publications
Hirotaka Hachiya
8 publications
Yuu Jinnai
8 publications
Tatsushi Oka
5 publications

research

∙ 08/25/2023

On the Depth between Beam Search and Exhaustive Search for Text Generation

Beam search and exhaustive search are two extreme ends of text decoding ...

0 Yuu Jinnai, et al. ∙

research

∙ 07/13/2023

Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative

Dialog policies, which determine a system's action based on the current ...

0 Sho Shimoyama, et al. ∙

research

∙ 06/08/2023

Safe Collaborative Filtering

Excellent tail performance is crucial for modern machine learning tasks,...

0 Riku Togashi, et al. ∙

research

∙ 06/02/2022

Policy Gradient Algorithms with Monte-Carlo Tree Search for Non-Markov Decision Processes

Policy gradient (PG) is a reinforcement learning (RL) approach that opti...

10 Tetsuro Morimura, et al. ∙

research

∙ 07/02/2019

Visual analytics for team-based invasion sports with significant events and Markov reward process

In team-based invasion sports such as soccer and basketball, analytics i...

3 Kun Zhao, et al. ∙

research

∙ 06/16/2019

Sampler for Composition Ratio by Markov Chain Monte Carlo

Invention involves combination, or more precisely, ratios of composition...

0 Yachiko Obara, et al. ∙

research

∙ 03/15/2012

Parametric Return Density Estimation for Reinforcement Learning

Most conventional Reinforcement Learning (RL) algorithms aim to optimize...

0 Tetsuro Morimura, et al. ∙

Tetsuro Morimura

Featured Co-authors

On the Depth between Beam Search and Exhaustive Search for Text Generation

Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative

Safe Collaborative Filtering

Policy Gradient Algorithms with Monte-Carlo Tree Search for Non-Markov Decision Processes

Visual analytics for team-based invasion sports with significant events and Markov reward process

Sampler for Composition Ratio by Markov Chain Monte Carlo

Parametric Return Density Estimation for Reinforcement Learning

Sign in with Google

Consider DeepAI Pro