Reinforcement Mechanism Design for e-commerce

08/25/2017
by   Qingpeng Cai, et al.
0

We study the problem of allocating impressions to sellers in e-commerce websites, such as Amazon, eBay or Taobao, aiming to maximize the total revenue generated by the platform. We employ a general framework of reinforcement mechanism design, which uses deep reinforcement learning to design efficient algorithms, taking the strategic behaviour of the sellers into account. Specifically, we model the impression allocation problem as a Markov decision process, where the states encode the history of impressions, prices, transactions and generated revenue and the actions are the possible impression allocations in each round. To tackle the problem of continuity and high-dimensionality of states and actions, we adopt the ideas of the DDPG algorithm to design an actor-critic policy gradient algorithm which takes advantage of the problem domain in order to achieve convergence and stability. We evaluate our proposed algorithm, coined IA(GRU), by comparing it against DDPG, as well as several natural heuristics, under different rationality models for the sellers - we assume that sellers follow well-known no-regret type strategies which may vary in their degree of sophistication. We find that IA(GRU) outperforms all algorithms in terms of the total revenue.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2017

A deep reinforcement learning framework for allocating buyer impressions in e-commerce websites

We study the problem of allocating impressions to sellers in e-commerce ...
research
09/28/2021

Exploring More When It Needs in Deep Reinforcement Learning

We propose a exploration mechanism of policy in Deep Reinforcement Learn...
research
04/02/2022

Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation

Ads allocation, that allocates ads and organic items to limited slots in...
research
10/18/2021

An actor-critic algorithm with deep double recurrent agents to solve the job shop scheduling problem

There is a growing interest in integrating machine learning techniques a...
research
02/22/2021

Communication Efficient Parallel Reinforcement Learning

We consider the problem where M agents interact with M identical and ind...
research
07/02/2018

Speeding up the Metabolism in E-commerce by Reinforcement Mechanism Design

In a large E-commerce platform, all the participants compete for impress...

Please sign up or login with your details

Forgot password? Click here to reset