Real-time Data Infrastructure at Uber

03/31/2021
by   Yupeng Fu, et al.
0

Uber's business is highly real-time in nature. PBs of data is continuously being collected from the end users such as Uber drivers, riders, restaurants, eaters and so on everyday. There is a lot of valuable information to be processed and many decisions must be made in seconds for a variety of use cases such as customer incentives, fraud detection, machine learning model prediction. In addition, there is an increasing need to expose this ability to different user categories, including engineers, data scientists, executives and operations personnel which adds to the complexity. In this paper, we present the overall architecture of the real-time data infrastructure and identify three scaling challenges that we need to continuously address for each component in the architecture. At Uber, we heavily rely on open source technologies for the key areas of the infrastructure. On top of those open-source software, we add significant improvements and customizations to make the open-source solutions fit in Uber's environment and bridge the gaps to meet Uber's unique scale and requirements. We then highlight several important use cases and show their real-time solutions and tradeoffs. Finally, we reflect on the lessons we learned as we built, operated and scaled these systems.

READ FULL TEXT
research
07/07/2020

The CMS monitoring infrastructure and applications

The globally distributed computing infrastructure required to cope with ...
research
05/03/2023

Locosim: an Open-Source Cross-Platform Robotics Framework

The architecture of a robotics software framework tremendously influence...
research
08/04/2023

Network Digital Twin for Open RAN: The Key Enablers, Standardization, and Use Cases

The open radio access network (O-RAN), with its disaggregated and open a...
research
03/22/2022

RT-Bench: an Extensible Benchmark Framework for the Analysis and Management of Real-Time Applications

Benchmarking is crucial for testing and validating any system, even more...
research
03/03/2022

Open Source Routers: A Survey

Variety, size and complexity of data types, services and applications in...
research
04/24/2019

On the Challenges and KPIs for Benchmarking Open-Source NFV MANO Systems: OSM vs ONAP

NFV management and orchestration (MANO) systems are being developed to m...
research
04/08/2022

FikoRE: 5G and Beyond RAN Emulator for Application Level Experimentation and Prototyping

Novel and cutting-edge use cases have arisen since the first deployments...

Please sign up or login with your details

Forgot password? Click here to reset