STREAMLINE: Streaming Active Learning for Realistic Multi-Distributional Settings

05/18/2023
by   Nathan Beck, et al.
0

Deep neural networks have consistently shown great performance in several real-world use cases like autonomous vehicles, satellite imaging, etc., effectively leveraging large corpora of labeled training data. However, learning unbiased models depends on building a dataset that is representative of a diverse range of realistic scenarios for a given task. This is challenging in many settings where data comes from high-volume streams, with each scenario occurring in random interleaved episodes at varying frequencies. We study realistic streaming settings where data instances arrive in and are sampled from an episodic multi-distributional data stream. Using submodular information measures, we propose STREAMLINE, a novel streaming active learning framework that mitigates scenario-driven slice imbalance in the working labeled data via a three-step procedure of slice identification, slice-aware budgeting, and data selection. We extensively evaluate STREAMLINE on real-world streaming scenarios for image classification and object detection tasks. We observe that STREAMLINE improves the performance on infrequent yet critical slices of the data over current baselines by up to 5% in terms of accuracy on our image classification tasks and by up to 8% in terms of mAP on our object detection tasks.

READ FULL TEXT

page 2

page 3

page 5

research
07/01/2021

SIMILAR: Submodular Information Measures Based Active Learning In Realistic Scenarios

Active learning has proven to be useful for minimizing labeling costs by...
research
11/30/2021

TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information

Deep neural networks based object detectors have shown great success in ...
research
06/17/2022

Active Data Discovery: Mining Unknown Data using Submodular Information Measures

Active Learning is a very common yet powerful framework for iteratively ...
research
01/16/2018

Localization-Aware Active Learning for Object Detection

Active learning - a class of algorithms that iteratively searches for th...
research
06/21/2021

Active Learning for Deep Neural Networks on Edge Devices

When dealing with deep neural network (DNN) applications on edge devices...
research
03/22/2023

Re-thinking Federated Active Learning based on Inter-class Diversity

Although federated learning has made awe-inspiring advances, most studie...
research
05/13/2022

Detecting Rumours with Latency Guarantees using Massive Streaming Data

Today's social networks continuously generate massive streams of data, w...

Please sign up or login with your details

Forgot password? Click here to reset