A deep reinforcement learning framework for allocating buyer impressions in e-commerce websites

08/25/2017
by   Qingpeng Cai, et al.
0

We study the problem of allocating impressions to sellers in e-commerce websites, such as Amazon, eBay or Taobao, aiming to maximize the total revenue generated by the platform. When a buyer searches for a keyword, the website presents the buyer with a list of different sellers for this item, together with the corresponding prices. This can be seen as an instance of a resource allocation problem in which the sellers choose their prices at each step and the platform decides how to allocate the impressions, based on the chosen prices and the historical transactions of each seller. Due to the complexity of the system, most e-commerce platforms employ heuristic allocation algorithms that mainly depend on the sellers' transaction records and without taking the rationality of the sellers into account, which makes them susceptible to several price manipulations. In this paper, we put forward a general framework of designing impression allocation algorithms in e-commerce websites given any behavioural model for the sellers, using deep reinforcement learning. The impression allocation problem is modeled as a Markov decision process, where the states encode the history of impressions, prices, transactions and generated revenue and the actions are the possible impression allocations at each round. To tackle the problem of continuity and high-dimensionality of states and actions, we adopt the ideas of the DDPG algorithm to design an actor-critic gradient policy algorithm which takes advantage of the problem domain in order to achieve covergence and stability. Our algorithm is compared against natural heuristics and it outperforms all of them in terms of the total revenue generated. Finally, contrary to the DDPG algorithm, our algorithm is robust to settings with variable sellers and easy to converge.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2017

Reinforcement Mechanism Design for e-commerce

We study the problem of allocating impressions to sellers in e-commerce ...
research
12/05/2019

Dynamic Pricing on E-commerce Platform with Deep Reinforcement Learning

In this paper we present an end-to-end framework for addressing the prob...
research
04/02/2022

Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation

Ads allocation, that allocates ads and organic items to limited slots in...
research
10/01/2021

Dynamic CU-DU Selection for Resource Allocation in O-RAN Using Actor-Critic Learning

Recently, there has been tremendous efforts by network operators and equ...
research
12/08/2021

Application of Deep Reinforcement Learning to Payment Fraud

The large variety of digital payment choices available to consumers toda...
research
10/29/2019

Resource Allocation Using Gradient Boosting Aided Deep Q-Network for IoT in C-RANs

In this paper, we investigate dynamic resource allocation (DRA) problems...
research
01/12/2018

A Quantitative Approach in Heuristic Evaluation of E-commerce Websites

This paper presents a pilot study on developing an instrument to predict...

Please sign up or login with your details

Forgot password? Click here to reset