DeepAI AI Chat
Log In Sign Up

Towards optimized actions in critical situations of soccer games with deep reinforcement learning

by   Pegah Rahimian, et al.
Budapest University of Technology and Economics

Soccer is a sparse rewarding game: any smart or careless action in critical situations can change the result of the match. Therefore players, coaches, and scouts are all curious about the best action to be performed in critical situations, such as the times with a high probability of losing ball possession or scoring a goal. This work proposes a new state representation for the soccer game and a batch reinforcement learning to train a smart policy network. This network gets the contextual information of the situation and proposes the optimal action to maximize the expected goal for the team. We performed extensive numerical experiments on the soccer logs made by InStat for 104 European soccer matches. The results show that in all 104 games, the optimized policy obtains higher rewards than its counterpart in the behavior policy. Besides, our framework learns policies that are close to the expected behavior in the real world. For instance, in the optimized policy, we observe that some actions such as foul, or ball out can be sometimes more rewarding than a shot in specific situations.


page 4

page 12


Action valuation of on- and off-ball soccer players based on multi-agent deep reinforcement learning

Analysis of invasive sports such as soccer is challenging because the ga...

BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning

The field of Deep Reinforcement Learning (DRL) has recently seen a surge...

A framework for the fine-grained evaluation of the instantaneous expected value of soccer possessions

The expected possession value (EPV) of a soccer possession represents th...

Multi-Stage Episodic Control for Strategic Exploration in Text Games

Text adventure games present unique challenges to reinforcement learning...

Language Understanding for Text-based Games Using Deep Reinforcement Learning

In this paper, we consider the task of learning control policies for tex...