Computation Resource Allocation Solution in Recommender Systems

03/03/2021
by   Xun Yang, et al.
0

Recommender systems rely heavily on increasing computation resources to improve their business goal. By deploying computation-intensive models and algorithms, these systems are able to inference user interests and exhibit certain ads or commodities from the candidate set to maximize their business goals. However, such systems are facing two challenges in achieving their goals. On the one hand, facing massive online requests, computation-intensive models and algorithms are pushing their computation resources to the limit. On the other hand, the response time of these systems is strictly limited to a short period, e.g. 300 milliseconds in our real system, which is also being exhausted by the increasingly complex models and algorithms. In this paper, we propose the computation resource allocation solution (CRAS) that maximizes the business goal with limited computation resources and response time. We comprehensively illustrate the problem and formulate such a problem as an optimization problem with multiple constraints, which could be broken down into independent sub-problems. To solve the sub-problems, we propose the revenue function to facilitate the theoretical analysis, and obtain the optimal computation resource allocation strategy. To address the applicability issues, we devise the feedback control system to help our strategy constantly adapt to the changing online environment. The effectiveness of our method is verified by extensive experiments based on the real dataset from Taobao.com. We also deploy our method in the display advertising system of Alibaba. The online results show that our computation resource allocation solution achieves significant business goal improvement without any increment of computation cost, which demonstrates the efficacy of our method in real industrial practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2020

DCAF: A Dynamic Computation Allocation Framework for Online Serving System

Modern large-scale systems such as recommender system and online adverti...
research
10/23/2021

Knowledge Transfer based Radio and Computation Resource Allocation for 5G RAN Slicing

To implement network slicing in 5G, resource allocation is a key functio...
research
07/15/2021

Automatic Resource Allocation in Business Processes: A Systematic Literature Survey

For delivering products or services to their clients, organizations exec...
research
09/20/2022

Synthesis of Cost-Optimal Multi-Agent Systems for Resource Allocation

Multi-agent systems for resource allocation (MRAs) have been introduced ...
research
06/03/2021

JIZHI: A Fast and Cost-Effective Model-As-A-Service System for Web-Scale Online Inference at Baidu

In modern internet industries, deep learning based recommender systems h...
research
04/19/2023

Learning policies for resource allocation in business processes

Resource allocation is the assignment of resources to activities that mu...
research
04/21/2017

A dynamic resource allocation decision model for IT security

Today, with the continued growth in using information and communication ...

Please sign up or login with your details

Forgot password? Click here to reset