DeepProphet2 – A Deep Learning Gene Recommendation Engine

08/03/2022
by   Daniele Brambilla, et al.
0

New powerful tools for tackling life science problems have been created by recent advances in machine learning. The purpose of the paper is to discuss the potential advantages of gene recommendation performed by artificial intelligence (AI). Indeed, gene recommendation engines try to solve this problem: if the user is interested in a set of genes, which other genes are likely to be related to the starting set and should be investigated? This task was solved with a custom deep learning recommendation engine, DeepProphet2 (DP2), which is freely available to researchers worldwide via https://www.generecommender.com?utm_source=DeepProphet2_paper utm_medium=pdf. Hereafter, insights behind the algorithm and its practical applications are illustrated. The gene recommendation problem can be addressed by mapping the genes to a metric space where a distance can be defined to represent the real semantic distance between them. To achieve this objective a transformer-based model has been trained on a well-curated freely available paper corpus, PubMed. The paper describes multiple optimization procedures that were employed to obtain the best bias-variance trade-off, focusing on embedding size and network depth. In this context, the model's ability to discover sets of genes implicated in diseases and pathways was assessed through cross-validation. A simple assumption guided the procedure: the network had no direct knowledge of pathways and diseases but learned genes' similarities and the interactions among them. Moreover, to further investigate the space where the neural network represents genes, the dimensionality of the embedding was reduced, and the results were projected onto a human-comprehensible space. In conclusion, a set of use cases illustrates the algorithm's potential applications in a real word setting.

READ FULL TEXT
research
10/19/2021

MultiHead MultiModal Deep Interest Recommendation Network

With the development of information technology, human beings are constan...
research
12/15/2020

SimpleChrome: Encoding of Combinatorial Effects for Predicting Gene Expression

Due to recent breakthroughs in state-of-the-art DNA sequencing technolog...
research
03/26/2019

A Silver Standard Corpus of Human Phenotype-Gene Relations

Human phenotype-gene relations are fundamental to fully understand the o...
research
07/18/2023

PubMed and Beyond: Biomedical Literature Search in the Age of Artificial Intelligence

Biomedical research yields a wealth of information, much of which is onl...
research
09/10/2020

The use of Recommender Systems in web technology and an in-depth analysis of Cold State problem

In the WWW (World Wide Web), dynamic development and spread of data has ...
research
11/22/2019

Learning Feature Interactions with Lorentzian Factorization Machine

Learning representations for feature interactions to model user behavior...
research
10/16/2017

Convolutional neural networks for structured omics: OmicsCNN and the OmicsConv layer

Convolutional Neural Networks (CNNs) are a popular deep learning archite...

Please sign up or login with your details

Forgot password? Click here to reset