Machine learning can guide experimental approaches for protein digestibility estimations

11/01/2022
by   Sara Malvar, et al.
0

Food protein digestibility and bioavailability are critical aspects in addressing human nutritional demands, particularly when seeking sustainable alternatives to animal-based proteins. In this study, we propose a machine learning approach to predict the true ileal digestibility coefficient of food items. The model makes use of a unique curated dataset that combines nutritional information from different foods with FASTA sequences of some of their protein families. We extracted the biochemical properties of the proteins and combined these properties with embeddings from a Transformer-based protein Language Model (pLM). In addition, we used SHAP to identify features that contribute most to the model prediction and provide interpretability. This first AI-based model for predicting food protein digestibility has an accuracy of 90 model can eliminate the need for lengthy in-vivo or in-vitro experiments, making the process of creating new foods faster, cheaper, and more ethical.

READ FULL TEXT

page 26

page 30

research
04/09/2021

Protein sequence design with deep generative models

Protein engineering seeks to identify protein sequences with optimized p...
research
06/09/2023

PoET: A generative model of protein families as sequences-of-sequences

Generative protein language models are a natural way to design new prote...
research
11/22/2019

Machine learning for protein folding and dynamics

Many aspects of the study of protein folding and dynamics have been affe...
research
10/27/2021

MutFormer: A context-dependent transformer-based model to predict pathogenic missense mutations

A missense mutation is a point mutation that results in a substitution o...
research
01/26/2021

Transforming India's Agricultural Sector using Ontology-based Tantra Framework

Food production is a critical activity in which every nation would like ...
research
02/08/2018

mGPfusion: Predicting protein stability changes with Gaussian process kernel learning and data fusion

Proteins are commonly used by biochemical industry for numerous processe...
research
11/08/2021

AI challenges for predicting the impact of mutations on protein stability

Stability is a key ingredient of protein fitness and its modification th...

Please sign up or login with your details

Forgot password? Click here to reset