When Blockchain Meets AI: Optimal Mining Strategy Achieved By Machine Learning

11/29/2019
by   Taotao Wang, et al.
0

This work applies reinforcement learning (RL) from the AI machine learning field to derive an optimal blockchain mining strategy without knowing the details of the blockchain network model. Previously, the most profitable mining strategy was believed to be honest mining encoded in the default Bitcoin-like blockchain protocol. It was shown later that it is possible to gain more mining rewards by deviating from honest mining. In particular, the mining problem can be formulated as a Markov Decision Process (MDP) which can be solved to give the optimal mining strategy. However, solving the mining MDP requires knowing the values of various parameters that characterize the blockchain network model. In real blockchain networks, these parameter values are not easy to obtain and may change over time. This hinders the use of the MDP model-based solution. In this work, we employ RL to dynamically learn a mining strategy with performance approaching that of the optimal mining strategy by observing and interacting with the network. Since the mining MDP problem has a non-linear objective function (rather than linear functions of standard MDP problems), we design a new multi-dimensional RL algorithm to solve the problem. Experimental results indicate that, without knowing the parameter values of the mining MDP model, our multi-dimensional RL mining algorithm can still achieve the optimal performance over time-varying blockchain networks.

READ FULL TEXT
research
07/10/2020

Efficient MDP Analysis for Selfish-Mining in Blockchains

A proof of work (PoW) blockchain protocol distributes rewards to its par...
research
02/20/2021

Importance of Environment Design in Reinforcement Learning: A Study of a Robotic Environment

An in-depth understanding of the particular environment is crucial in re...
research
07/09/2021

Safe Exploration by Solving Early Terminated MDP

Safe exploration is crucial for the real-world application of reinforcem...
research
12/10/2021

A Validation Tool for Designing Reinforcement Learning Environments

Reinforcement learning (RL) has gained increasing attraction in the acad...
research
02/19/2020

Toward Low-Cost and Stable Blockchain Networks

Envisioned to be the future of distributed systems, blockchain networks ...
research
06/07/2021

Closed-Form Analytical Results for Maximum Entropy Reinforcement Learning

We introduce a mapping between Maximum Entropy Reinforcement Learning (M...
research
11/01/2018

A Deep Dive into Blockchain Selfish Mining

This paper studies a fundamental problem regarding the security of block...

Please sign up or login with your details

Forgot password? Click here to reset