Positional Paper: Schema-First Application Telemetry

06/22/2022
by   Yuri Shkuro, et al.
0

Application telemetry refers to measurements taken from software systems to assess their performance, availability, correctness, efficiency, and other aspects useful to operators, as well as to troubleshoot them when they behave abnormally. Many modern observability platforms support dimensional models of telemetry signals where the measurements are accompanied by additional dimensions used to identify either the resources described by the telemetry or the business-specific attributes of the activities (e.g., a customer identifier). However, most of these platforms lack any semantic understanding of the data, by not capturing any metadata about telemetry, from simple aspects such as units of measure or data types (treating all dimensions as strings) to more complex concepts such as purpose policies. This limits the ability of the platforms to provide a rich user experience, especially when dealing with different telemetry assets, for example, linking an anomaly in a time series with the corresponding subset of logs or traces, which requires semantic understanding of the dimensions in the respective data sets. In this paper, we describe a schema-first approach to application telemetry that is being implemented at Meta. It allows the observability platforms to capture metadata about telemetry from the start and enables a wide range of functionalities, including compile-time input validation, multi-signal correlations and cross-filtering, and even privacy rules enforcement. We present a collection of design goals and demonstrate how schema-first approach provides better trade-offs than many of the existing solutions in the industry.

READ FULL TEXT

page 6

page 7

research
03/30/2020

Making Metadata Fit for Next Generation Language Technology Platforms: The Metadata Schema of the European Language Grid

The current scientific and technological landscape is characterised by t...
research
06/08/2022

Towards Schema Inference for Data Lakes

A data lake is a repository of data with potential for future analysis. ...
research
07/31/2023

A Modular Ontology for MODS – Metadata Object Description Schema

The Metadata Object Description Schema (MODS) was developed to describe ...
research
07/06/2023

JSONoid: Monoid-based Enrichment for Configurable and Scalable Data-Driven Schema Discovery

Schema discovery is an important aspect to working with data in formats ...
research
06/15/2019

A formal approach for customization of schema.org based on SHACL

Schema.org is a widely adopted vocabulary for semantic annotation of con...
research
05/05/2022

GreenDB: Toward a Product-by-Product Sustainability Database

The production, shipping, usage, and disposal of consumer goods have a s...

Please sign up or login with your details

Forgot password? Click here to reset