Journey of Migrating Millions of Queries on The Cloud

05/17/2022
by   Taro L. Saito, et al.
0

Treasure Data is processing millions of distributed SQL queries every day on the cloud. Upgrading the query engine service at this scale is challenging because we need to migrate all of the production queries of the customers to a new version while preserving the correctness and performance of the data processing pipelines. To ensure the quality of the query engines, we utilize our query logs to build customer-specific benchmarks and replay these queries with real customer data in a secure pre-production environment. To simulate millions of queries, we need effective minimization of test query sets and better reporting of the simulation results to proactively find incompatible changes and performance regression of the new version. This paper describes the overall design of our system and shares various challenges in maintaining the quality of the query engine service on the cloud.

READ FULL TEXT
research
07/06/2020

Reducing Misinformation in Query Autocompletions

Query autocompletions help users of search engines to speed up their sea...
research
04/12/2022

Forecasting SQL Query Cost at Twitter

With the advent of the Big Data era, it is usually computationally expen...
research
06/06/2018

Extended Diffix

A longstanding open problem is that of how to get high quality statistic...
research
08/09/2022

Doppler: Automated SKU Recommendation in Migrating SQL Workloads to the Cloud

Selecting the optimal cloud target to migrate SQL estates from on-premis...
research
12/20/2019

Shareable Representations for Search Query Understanding

Understanding search queries is critical for shopping search engines to ...
research
08/27/2020

Cost-based Query Rewriting Techniques for Optimizing Aggregates Over Correlated Windows

Window aggregates are ubiquitous in stream processing. In Azure Stream A...
research
11/03/2022

Introducing Hermes: Executing Clinical Quality Language (CQL) at over 66 Million Resources per Second (inexpensively)

Clinical Quality Language (CQL) has emerged as a standard for rule repre...

Please sign up or login with your details

Forgot password? Click here to reset