Taming Hybrid-Cloud Fast and Scalable Graph Analytics at Twitter

04/24/2022
by   Chunxu Tang, et al.
0

We have witnessed a boosted demand for graph analytics at Twitter in recent years, and graph analytics has become one of the key parts of Twitter's large-scale data analytics and machine learning for driving engagement, serving the most relevant content, and promoting healthier conversations. However, infrastructure for graph analytics has historically not been an area of investment at Twitter, resulting in a long timeline and huge engineering effort for each project to deal with graphs at the Twitter scale. How do we build a unified graph analytics user experience to fulfill modern data analytics on various graph scales spanning from thousands to hundreds of billions of vertices and edges? To bring fast and scalable graph analytics capability into production, we investigate the challenges we are facing in large-scale graph analytics at Twitter and propose a unified graph analytics platform for efficient, scalable, and reliable graph analytics across on-premises and cloud, to fulfill the requirements of diverse graph use cases and challenging scales. We also conduct quantitative benchmarking on Twitter's production-level graph use cases between popular graph analytics frameworks to certify our solution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/09/2022

Serving Hybrid-Cloud SQL Interactive Queries at Twitter

The demand for data analytics has been consistently increasing in the pa...
research
10/27/2020

In-situ data analytics for highly scalable cloud modelling on Cray machines

MONC is a highly scalable modelling tool for the investigation of atmosp...
research
07/21/2022

Templating Shuffles

Cloud data centers are rapidly evolving. At the same time, large-scale d...
research
04/15/2022

Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale

We introduce Saga, a next-generation knowledge construction and serving ...
research
03/23/2018

GreyCat: Efficient What-If Analytics for Data in Motion at Scale

Over the last few years, data analytics shifted from a descriptive era, ...
research
08/22/2020

Brushing Feature Values in Immersive Graph Visualization Environment

There are a variety of graphs where multidimensional feature values are ...
research
02/01/2019

OODIDA: On-board/Off-board Distributed Data Analytics for Connected Vehicles

Connected vehicles may produce gigabytes of data per hour, which makes c...

Please sign up or login with your details

Forgot password? Click here to reset