Quantifying and Extrapolating Data Needs in Radio Frequency Machine Learning

05/07/2022
by   William H. Clark IV, et al.
0

Understanding the relationship between training data and a model's performance once deployed is a fundamental component in the application of machine learning. While the model's deployed performance is dependent on numerous variables within the scope of machine learning, beyond that of the training data itself, the effect of the dataset is isolated in this work to better understand the role training data plays in the problem. This work examines a modulation classification problem in the Radio Frequency domain space, attempting to answer the question of how much training data is required to achieve a desired level of performance, but the procedure readily applies to classification problems across modalities. By repurposing the metrics of transfer potential developed within transfer learning an approach to bound data quantity needs developed given a training approach and machine learning architecture; this approach is presented as a means to estimate data quantity requirements to achieve a target performance. While this approach will require an initial dataset that is germane to the problem space to act as a target dataset on which metrics are extracted, the goal is to allow for the initial data to be orders of magnitude smaller than what is required for delivering a system that achieves the desired performance. An additional benefit of the techniques presented here is that the quality of different datasets can be numerically evaluated and tied together with the quantity of data, and the performance of the system.

READ FULL TEXT
research
10/01/2020

Training Data Augmentation for Deep Learning RF Systems

Applications of machine learning are subject to three major components t...
research
10/18/2019

Machine learning Calabi-Yau metrics

We apply machine learning to the problem of finding numerical Calabi-Yau...
research
10/06/2022

Data-driven Approaches to Surrogate Machine Learning Model Development

We demonstrate the adaption of three established methods to the field of...
research
07/20/2022

Large Scale Radio Frequency Signal Classification

Existing datasets used to train deep learning models for narrowband radi...
research
11/11/2022

Striving for data-model efficiency: Identifying data externalities on group performance

Building trustworthy, effective, and responsible machine learning system...
research
12/11/2020

Data Appraisal Without Data Sharing

One of the most effective approaches to improving the performance of a m...
research
12/30/2019

Quantifying the Performance of Federated Transfer Learning

The scarcity of data and isolated data islands encourage different organ...

Please sign up or login with your details

Forgot password? Click here to reset