Using deep Q-learning to understand the tax evasion behavior of risk-averse firms

01/29/2018
by   Nikolaos D. Goumagias, et al.
0

Designing tax policies that are effective in curbing tax evasion and maximize state revenues requires a rigorous understanding of taxpayer behavior. This work explores the problem of determining the strategy a self-interested, risk-averse tax entity is expected to follow, as it "navigates" - in the context of a Markov Decision Process - a government-controlled tax environment that includes random audits, penalties and occasional tax amnesties. Although simplified versions of this problem have been previously explored, the mere assumption of risk-aversion (as opposed to risk-neutrality) raises the complexity of finding the optimal policy well beyond the reach of analytical techniques. Here, we obtain approximate solutions via a combination of Q-learning and recent advances in Deep Reinforcement Learning. By doing so, we i) determine the tax evasion behavior expected of the taxpayer entity, ii) calculate the degree of risk aversion of the "average" entity given empirical estimates of tax evasion, and iii) evaluate sample tax policies, in terms of expected revenues. Our model can be useful as a testbed for "in-vitro" testing of tax policies, while our results lead to various policy recommendations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2017

Autonomous Braking System via Deep Reinforcement Learning

In this paper, we propose a new autonomous braking system based on deep ...
research
02/27/2023

Distributional Method for Risk Averse Reinforcement Learning

We introduce a distributional method for learning the optimal policy in ...
research
06/27/2019

Adaptive Honeypot Engagement through Reinforcement Learning of Semi-Markov Decision Processes

The honeynet is a promising active cyber defense mechanism. It reveals t...
research
06/06/2022

Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks

We consider the off-policy evaluation problem of reinforcement learning ...
research
05/02/2017

Navigating Intersections with Autonomous Vehicles using Deep Reinforcement Learning

Providing an efficient strategy to navigate safely through unsignaled in...
research
03/04/2021

On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

In order to model risk aversion in reinforcement learning, an emerging l...
research
07/21/2020

Flow Sampling: Accurate and Load-balanced Sampling Policies

Software-defined networking simplifies network monitoring by means of pe...

Please sign up or login with your details

Forgot password? Click here to reset