Speeding up the Metabolism in E-commerce by Reinforcement Mechanism Design

07/02/2018
by   Hua-Lin He, et al.
0

In a large E-commerce platform, all the participants compete for impressions under the allocation mechanism of the platform. Existing methods mainly focus on the short-term return based on the current observations instead of the long-term return. In this paper, we formally establish the lifecycle model for products, by defining the introduction, growth, maturity and decline stages and their transitions throughout the whole life period. Based on such model, we further propose a reinforcement learning based mechanism design framework for impression allocation, which incorporates the first principal component based permutation and the novel experiences generation method, to maximize short-term as well as long-term return of the platform. With the power of trial-and-error, it is possible to optimize impression allocation strategies globally which is contribute to the healthy development of participants and the platform itself. We evaluate our algorithm on a simulated environment built based on one of the largest E-commerce platforms, and a significant improvement has been achieved in comparison with the baseline solutions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2021

Long-term IaaS Provider Selection using Short-term Trial Experience

We propose a novel approach to select privacy-sensitive IaaS providers f...
research
08/20/2021

Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation

Recommender system plays a crucial role in modern E-commerce platform. D...
research
11/01/2020

Long-term IaaS Selection using Performance Discovery

We propose a novel framework to select IaaS providers according to a con...
research
04/05/2022

Learning to Bid Long-Term: Multi-Agent Reinforcement Learning with Long-Term and Sparse Reward in Repeated Auction Games

We propose a multi-agent distributed reinforcement learning algorithm th...
research
01/01/2023

Designing organizations for bottom-up task allocation: The role of incentives

In recent years, various decentralized organizational forms have emerged...
research
11/14/2018

Predictive Modeling with Delayed Information: a Case Study in E-commerce Transaction Fraud Control

In Business Intelligence, accurate predictive modeling is the key for pr...
research
08/25/2017

Reinforcement Mechanism Design for e-commerce

We study the problem of allocating impressions to sellers in e-commerce ...

Please sign up or login with your details

Forgot password? Click here to reset