Dataset Culling: Towards Efficient Training Of Distillation-Based Domain Specific Models

02/01/2019
by   Kentaro Yoshioka, et al.
0

Real-time CNN based object detection models for applications like surveillance can achieve high accuracy but require extensive computations. Recent work has shown 10 to 100x reduction in computation cost with domain-specific network settings. However, this prior work focused on inference only: if the domain network requires frequent retraining, training and retraining costs can be a significant bottleneck. To address training costs, we propose Dataset Culling: a pipeline to significantly reduce the required training dataset size for domain specific models. Dataset Culling reduces the dataset size by filtering out non-essential data for train-ing, and reducing the size of each image until detection degrades. Both of these operations use a confusion loss metric which enables us to execute the culling with minimal computation overhead. On a custom long-duration dataset, we show that Dataset Culling can reduce the training costs 47x with no accuracy loss or even with slight improvements. Codes are available: https://github.com/kentaroy47/DatasetCulling

READ FULL TEXT
research
11/06/2018

Training Domain Specific Models for Energy-Efficient Object Detection

We propose an end-to-end framework for training domain specific models (...
research
10/04/2018

Domain Specific Approximation for Object Detection

There is growing interest in object detection in advanced driver assista...
research
09/19/2023

Decoupled Training: Return of Frustratingly Easy Multi-Domain Learning

Multi-domain learning (MDL) aims to train a model with minimal average r...
research
12/28/2021

Deep-CNN based Robotic Multi-Class Under-Canopy Weed Control in Precision Farming

Smart weeding systems to perform plant-specific operations can contribut...
research
04/26/2022

Bamboo: Making Preemptible Instances Resilient for Affordable Training of Large DNNs

DNN models across many domains continue to grow in size, resulting in hi...
research
04/02/2022

Online Convolutional Re-parameterization

Structural re-parameterization has drawn increasing attention in various...
research
06/11/2020

JIT-Masker: Efficient Online Distillation for Background Matting

We design a real-time portrait matting pipeline for everyday use, partic...

Please sign up or login with your details

Forgot password? Click here to reset