Iterative Peptide Modeling With Active Learning And Meta-Learning

by   Rainier Barrett, et al.

Often the development of novel materials is not amenable to high-throughput or purely computational screening methods. Instead, materials must be synthesized one at a time in a process that does not generate significant amounts of data. One way this method can be improved is by ensuring that each experiment provides the best improvement in both material properties and predictive modeling accuracy. In this work, we study the effectiveness of active learning, which optimizes the order of experiments, and meta learning, which transfers knowledge from one context to another, to reduce the number of experiments necessary to build a predictive model. We present a novel multi-task benchmark database of peptides designed to advance active, few-shot, and meta-learning methods for experimental design. Each task is binary classification of peptides represented as a sequence string. We show results of standard active learning and meta-learning methods across these datasets to assess their ability to improve predictive models with the fewest number of experiments. We find the ensemble query by committee active learning method to be effective. The meta-learning method Reptile was found to improve accuracy. The robustness of these conclusions were tested across multiple model choices.


page 1

page 2

page 3

page 4


Learning to Sample: an Active Learning Framework

Meta-learning algorithms for active learning are emerging as a promising...

Meta-Active Learning for Node Response Prediction in Graphs

Meta-learning is an important approach to improve machine learning perfo...

Meta-QSAR: a large-scale application of meta-learning to drug design and discovery

We investigate the learning of quantitative structure activity relations...

Stochastic Deep Networks with Linear Competing Units for Model-Agnostic Meta-Learning

This work addresses meta-learning (ML) by considering deep networks with...

Learn to Explore: on Bootstrapping Interactive Data Exploration with Meta-learning

Interactive data exploration (IDE) is an effective way of comprehending ...

Assessing the Frontier: Active Learning, Model Accuracy, and Multi-objective Materials Discovery and Optimization

Discovering novel materials can be greatly accelerated by iterative mach...

Enabling Reproducibility and Meta-learning Through a Lifelong Database of Experiments (LDE)

Artificial Intelligence (AI) development is inherently iterative and exp...

Please sign up or login with your details

Forgot password? Click here to reset