Data Lifecycle Management in Evolving Input Distributions for Learning-based Aerospace Applications

09/14/2022
by   Somrita Banerjee, et al.
0

As input distributions evolve over a mission lifetime, maintaining performance of learning-based models becomes challenging. This paper presents a framework to incrementally retrain a model by selecting a subset of test inputs to label, which allows the model to adapt to changing input distributions. Algorithms within this framework are evaluated based on (1) model performance throughout mission lifetime and (2) cumulative costs associated with labeling and model retraining. We provide an open-source benchmark of a satellite pose estimation model trained on images of a satellite in space and deployed in novel scenarios (e.g., different backgrounds or misbehaving pixels), where algorithms are evaluated on their ability to maintain high performance by retraining on a subset of inputs. We also propose a novel algorithm to select a diverse subset of inputs for labeling, by characterizing the information gain from an input using Bayesian uncertainty quantification and choosing a subset that maximizes collective information gain using concepts from batch active learning. We show that our algorithm outperforms others on the benchmark, e.g., achieves comparable performance to an algorithm that labels 100 while only labeling 50 over the mission lifetime.

READ FULL TEXT
research
06/07/2017

Active Learning for Structured Prediction from Partially Labeled Data

We propose a general purpose active learning algorithm for structured pr...
research
05/24/2021

Cost-Accuracy Aware Adaptive Labeling for Active Learning

Conventional active learning algorithms assume a single labeler that pro...
research
05/28/2018

Learning From Less Data: Diversified Subset Selection and Active Learning in Image Classification Tasks

Supervised machine learning based state-of-the-art computer vision techn...
research
10/27/2020

Active Learning for Noisy Data Streams Using Weak and Strong Labelers

Labeling data correctly is an expensive and challenging task in machine ...
research
12/27/2021

Active Learning with Pseudo-Labels for Multi-View 3D Pose Estimation

Pose estimation of the human body/hand is a fundamental problem in compu...
research
05/02/2022

Simple Techniques Work Surprisingly Well for Neural Network Test Prioritization and Active Learning (Replicability Study)

Test Input Prioritizers (TIP) for Deep Neural Networks (DNN) are an impo...
research
03/22/2022

Frugal Learning of Virtual Exemplars for Label-Efficient Satellite Image Change Detection

In this paper, we devise a novel interactive satellite image change dete...

Please sign up or login with your details

Forgot password? Click here to reset