NNStreamer: Efficient and Agile Development of On-Device AI Systems

01/16/2021
by   MyungJoo Ham, et al.
0

We propose NNStreamer, a software system that handles neural networks as filters of stream pipelines, applying the stream processing paradigm to deep neural network applications. A new trend with the wide-spread of deep neural network applications is on-device AI. It is to process neural networks on mobile devices or edge/IoT devices instead of cloud servers. Emerging privacy issues, data transmission costs, and operational costs signify the need for on-device AI, especially if we deploy a massive number of devices. NNStreamer efficiently handles neural networks with complex data stream pipelines on devices, significantly improving the overall performance with minimal efforts. Besides, NNStreamer simplifies implementations and allows reusing off-the-shelf media filters directly, which reduces developmental costs significantly. We are already deploying NNStreamer for a wide range of products and platforms, including the Galaxy series and various consumer electronic devices. The experimental results suggest a reduction in developmental costs and enhanced performance of pipeline architectures and NNStreamer. It is an open-source project incubated by Linux Foundation AI, available to the public and applicable to various hardware and software platforms.

READ FULL TEXT
research
01/12/2019

NNStreamer: Stream Processing Paradigm for Neural Networks, Toward Efficient Development and Execution of On-Device AI Applications

We propose nnstreamer, a software system that handles neural networks as...
research
01/16/2022

Toward Among-Device AI from On-Device AI with Stream Pipelines

Modern consumer electronic devices often provide intelligence services w...
research
06/09/2022

NNTrainer: Light-Weight On-Device Training Framework

Modern consumer electronic devices have adopted deep learning-based inte...
research
02/10/2022

A VM/Containerized Approach for Scaling TinyML Applications

Although deep neural networks are typically computationally expensive to...
research
03/18/2019

AttoNets: Compact and Efficient Deep Neural Networks for the Edge via Human-Machine Collaborative Design

While deep neural networks have achieved state-of-the-art performance ac...
research
02/15/2022

CQELS 2.0: Towards A Unified Framework for Semantic Stream Fusion

We present CQELS 2.0, the second version of Continuous Query Evaluation ...
research
04/21/2023

Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations

The rapid development and application of foundation models have revoluti...

Please sign up or login with your details

Forgot password? Click here to reset