GeoBlocks: A Query-Driven Storage Layout for Geospatial Data

08/21/2019
by   Christian Winter, et al.
0

City authorities need to analyze urban geospatial data to improve transportation and infrastructure. Current tools do not address the exploratory and interactive nature of these analyses and in many cases consult the raw data to compute query results. While pre-aggregation and materializing intermediate query results is common practice in many OLAP settings, it is rarely used to speed up geospatial queries. We introduce GeoBlocks, a pre-aggregating, query-driven storage layout for geospatial point data that can provide approximate, yet precision-bounded aggregation results over arbitrary query polygons. GeoBlocks adapt to the skew naturally present in query workloads to improve query performance over time. In summary, GeoBlocks outperform on-the-fly aggregation by up to several orders of magnitude, providing the sub-second query latencies required for interactive analytics.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset