GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge

01/19/2022
by   Arthi Padmanabhan, et al.
4

Video analytics pipelines have steadily shifted to edge deployments to reduce bandwidth overheads and privacy violations, but in doing so, face an ever-growing resource tension. Most notably, edge-box GPUs lack the memory needed to concurrently house the growing number of (increasingly complex) models for real-time inference. Unfortunately, existing solutions that rely on time/space sharing of GPU resources are insufficient as the required swapping delays result in unacceptable frame drops and accuracy violations. We present model merging, a new memory management technique that exploits architectural similarities between edge vision models by judiciously sharing their layers (including weights) to reduce workload memory costs and swapping delays. Our system, GEMEL, efficiently integrates merging into existing pipelines by (1) leveraging several guiding observations about per-model memory usage and inter-layer dependencies to quickly identify fruitful and accuracy-preserving merging configurations, and (2) altering edge inference schedules to maximize merging benefits. Experiments across diverse workloads reveal that GEMEL reduces memory usage by up to 60.7 relative to time/space sharing alone.

READ FULL TEXT
research
04/04/2023

MadEye: Boosting Live Video Analytics Accuracy with Adaptive Camera Configurations

Camera orientations (i.e., rotation and zoom) govern the content that a ...
research
06/29/2022

Turbo: Opportunistic Enhancement for Edge Video Analytics

Edge computing is being widely used for video analytics. To alleviate th...
research
12/19/2020

Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers

Video analytics applications use edge compute servers for the analytics ...
research
08/03/2018

Edge Based Data-Driven Pipelines (Technical Report)

This research reports investigates an edge on-device stream processing p...
research
09/28/2020

Accelerating Multi-Model Inference by Merging DNNs of Different Weights

Standardized DNN models that have been proved to perform well on machine...
research
09/09/2017

Can you tell a face from a HEVC bitstream?

Image and video analytics are being increasingly used on a massive scale...
research
05/18/2021

Towards Performance Clarity of Edge Video Analytics

Edge video analytics is becoming the solution to many safety and managem...

Please sign up or login with your details

Forgot password? Click here to reset