ChemVise: Maximizing Out-of-Distribution Chemical Detection with the Novel Application of Zero-Shot Learning

02/09/2023
by   Alexander M. Moore, et al.
0

Accurate chemical sensors are vital in medical, military, and home safety applications. Training machine learning models to be accurate on real world chemical sensor data requires performing many diverse, costly experiments in controlled laboratory settings to create a data set. In practice even expensive, large data sets may be insufficient for generalization of a trained model to a real-world testing distribution. Rather than perform greater numbers of experiments requiring exhaustive mixtures of chemical analytes, this research proposes learning approximations of complex exposures from training sets of simple ones by using single-analyte exposure signals as building blocks of a multiple-analyte space. We demonstrate this approach to synthetic sensor responses surprisingly improves the detection of out-of-distribution obscured chemical analytes. Further, we pair these synthetic signals to targets in an information-dense representation space utilizing a large corpus of chemistry knowledge. Through utilization of a semantically meaningful analyte representation spaces along with synthetic targets we achieve rapid analyte classification in the presence of obscurants without corresponding obscured-analyte training data. Transfer learning for supervised learning with molecular representations makes assumptions about the input data. Instead, we borrow from the natural language and natural image processing literature for a novel approach to chemical sensor signal classification using molecular semantics for arbitrary chemical sensor hardware designs.

READ FULL TEXT

page 2

page 3

page 10

research
04/27/2023

Molecular Design Based on Integer Programming and Splitting Data Sets by Hyperplanes

A novel framework for designing the molecular structure of chemical comp...
research
01/06/2023

Discovery of structure-property relations for molecules via hypothesis-driven active learning over the chemical space

Discovery of the molecular candidates for applications in drug targets, ...
research
12/03/2022

Calibration and generalizability of probabilistic models on low-data chemical datasets with DIONYSUS

Deep learning models that leverage large datasets are often the state of...
research
02/02/2021

Unassisted Noise Reduction of Chemical Reaction Data Sets

Existing deep learning models applied to reaction prediction in organic ...
research
09/02/2022

IMG2IMU: Applying Knowledge from Large-Scale Images to IMU Applications via Contrastive Learning

Recent advances in machine learning showed that pre-training representat...
research
02/19/2022

Image-to-Graph Transformers for Chemical Structure Recognition

For several decades, chemical knowledge has been published in written te...
research
03/30/2020

Deep Molecular Programming: A Natural Implementation of Binary-Weight ReLU Neural Networks

Embedding computation in molecular contexts incompatible with traditiona...

Please sign up or login with your details

Forgot password? Click here to reset