Approximate Edge Analytics for the IoT Ecosystem

05/15/2018
by   Zhenyu Wen, et al.
0

IoT-enabled devices continue to generate a massive amount of data. Transforming this continuously arriving raw data into timely insights is critical for many modern online services. For such settings, the traditional form of data analytics over the entire dataset would be prohibitively limiting and expensive for supporting real-time stream analytics. In this work, we make a case for approximate computing for data analytics in IoT settings. Approximate computing aims for efficient execution of workflows where an approximate output is sufficient instead of the exact output. The idea behind approximate computing is to compute over a representative sample instead of the entire input dataset. Thus, approximate computing - based on the chosen sample size - can make a systematic trade-off between the output accuracy and computation efficiency. This motivated the design of APPROXIOT - a data analytics system for approximate computing in IoT. To realize this idea, we designed an online hierarchical stratified reservoir sampling algorithm that uses edge computing resources to produce approximate output with rigorous error bounds. To showcase the effectiveness of our algorithm, we implemented APPROXIOT based on Apache Kafka and evaluated its effectiveness using a set of microbenchmarks and real-world case studies. Our results show that APPROXIOT achieves a speedup 1.3X-9.9X with varying sampling fraction of 80 compared to simple random sampling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2018

Analytics for the Internet of Things: A Survey

The Internet of Things (IoT) envisions a world-wide, interconnected netw...
research
05/15/2018

Approximate Distributed Joins in Apache Spark

The join operation is a fundamental building block of parallel data proc...
research
06/01/2023

When Edge Computing Meets Compact Data Structures

Edge computing enables data processing and storage closer to where the d...
research
12/05/2018

Approximation with Error Bounds in Spark

We introduce a sampling framework to support approximate computing with ...
research
05/11/2023

Bringing AI to the edge: A formal M S specification to deploy effective IoT architectures

The Internet of Things is transforming our society, providing new servic...
research
04/22/2019

Maximum Lifetime Analytics in IoT Networks

This paper studies the problem of allocating bandwidth and computation r...
research
01/19/2021

Explainable Patterns: Going from Findings to Insights to Support Data Analytics Democratization

In the past decades, massive efforts involving companies, non-profit org...

Please sign up or login with your details

Forgot password? Click here to reset