A hybrid model-based and learning-based approach for classification using limited number of training samples

06/25/2021
by   Alireza Nooraiepour, et al.
0

The fundamental task of classification given a limited number of training data samples is considered for physical systems with known parametric statistical models. The standalone learning-based and statistical model-based classifiers face major challenges towards the fulfillment of the classification task using a small training set. Specifically, classifiers that solely rely on the physics-based statistical models usually suffer from their inability to properly tune the underlying unobservable parameters, which leads to a mismatched representation of the system's behaviors. Learning-based classifiers, on the other hand, typically rely on a large number of training data from the underlying physical process, which might not be feasible in most practical scenarios. In this paper, a hybrid classification method – termed HyPhyLearn – is proposed that exploits both the physics-based statistical models and the learning-based classifiers. The proposed solution is based on the conjecture that HyPhyLearn would alleviate the challenges associated with the individual approaches of learning-based and statistical model-based classifiers by fusing their respective strengths. The proposed hybrid approach first estimates the unobservable model parameters using the available (suboptimal) statistical estimation procedures, and subsequently use the physics-based statistical models to generate synthetic data. Then, the training data samples are incorporated with the synthetic data in a learning-based classifier that is based on domain-adversarial training of neural networks. Specifically, in order to address the mismatch problem, the classifier learns a mapping from the training data and the synthetic data to a common feature space. Simultaneously, the classifier is trained to find discriminative features within this space in order to fulfill the classification task.

READ FULL TEXT
research
12/13/2017

GMM-Based Synthetic Samples for Classification of Hyperspectral Images With Limited Training Data

The amount of training data that is required to train a classifier scale...
research
11/05/2018

Active Deep Learning Attacks under Strict Rate Limitations for Online API Calls

Machine learning has been applied to a broad range of applications and s...
research
07/31/2019

Few-Shot Meta-Denoising

We study the problem of learning-based denoising where the training set ...
research
07/03/2021

Learning from scarce information: using synthetic data to classify Roman fine ware pottery

In this article we consider a version of the challenging problem of lear...
research
10/11/2019

Statistical Linear Models in Virus Genomic Alignment-free Classification: Application to Hepatitis C Viruses

Viral sequence classification is an important task in pathogen detection...
research
01/28/2020

Identifying Mislabeled Data using the Area Under the Margin Ranking

Not all data in a typical training set help with generalization; some sa...
research
07/27/2022

Statistical Keystroke Synthesis for Improved Bot Detection

This work proposes two statistical approaches for the synthesis of keyst...

Please sign up or login with your details

Forgot password? Click here to reset