Trustworthiness of Laser-Induced Breakdown Spectroscopy Predictions via Simulation-based Synthetic Data Augmentation and Multitask Learning

10/07/2022
by   Riccardo Finotello, et al.
0

We consider quantitative analyses of spectral data using laser-induced breakdown spectroscopy. We address the small size of training data available, and the validation of the predictions during inference on unknown data. For the purpose, we build robust calibration models using deep convolutional multitask learning architectures to predict the concentration of the analyte, alongside additional spectral information as auxiliary outputs. These secondary predictions can be used to validate the trustworthiness of the model by taking advantage of the mutual dependencies of the parameters of the multitask neural networks. Due to the experimental lack of training samples, we introduce a simulation-based data augmentation process to synthesise an arbitrary number of spectra, statistically representative of the experimental data. Given the nature of the deep learning model, no dimensionality reduction or data selection processes are required. The procedure is an end-to-end pipeline including the process of synthetic data augmentation, the construction of a suitable robust, homoscedastic, deep learning model, and the validation of its predictions. In the article, we compare the performance of the multitask model with traditional univariate and multivariate analyses, to highlight the separate contributions of each element introduced in the process.

READ FULL TEXT

page 32

page 34

research
10/29/2019

Multitask Learning On Graph Neural Networks Applied To Molecular Property Predictions

Prediction of molecular properties, including physico-chemical propertie...
research
12/14/2021

Improving COVID-19 CXR Detection with Synthetic Data Augmentation

Since the beginning of the COVID-19 pandemic, researchers have developed...
research
05/30/2023

Simulation-Aided Deep Learning for Laser Ultrasonic Visualization Testing

In recent years, laser ultrasonic visualization testing (LUVT) has attra...
research
02/25/2022

PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks

This paper focuses on the Data Augmentation for low-resource Natural Lan...
research
12/21/2019

A Deep Learning Model for Chilean Bills Classification

Automatic bill classification is an attractive task with many potential ...
research
01/25/2021

Few-Shot Website Fingerprinting Attack

This work introduces a novel data augmentation method for few-shot websi...
research
09/28/2022

Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition

An insufficient number of training samples is a common problem in neural...

Please sign up or login with your details

Forgot password? Click here to reset