Using the Gene Ontology Hierarchy when Predicting Gene Function

05/09/2012
by   Sara Mostafavi, et al.
0

The problem of multilabel classification when the labels are related through a hierarchical categorization scheme occurs in many application domains such as computational biology. For example, this problem arises naturally when trying to automatically assign gene function using a controlled vocabularies like Gene Ontology. However, most existing approaches for predicting gene functions solve independent classification problems to predict genes that are involved in a given function category, independently of the rest. Here, we propose two simple methods for incorporating information about the hierarchical nature of the categorization scheme. In the first method, we use information about a gene's previous annotation to set an initial prior on its label. In a second approach, we extend a graph-based semi-supervised learning algorithm for predicting gene function in a hierarchy. We show that we can efficiently solve this problem by solving a linear system of equations. We compare these approaches with a previous label reconciliation-based approach. Results show that using the hierarchy information directly, compared to using reconciliation methods, improves gene function prediction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2020

Mining Functionally Related Genes with Semi-Supervised Learning

The study of biological processes can greatly benefit from tools that au...
research
09/21/2022

SGC: A semi-supervised pipeline for gene clustering using self-training approach in gene co-expression networks

A widely used approach for extracting information from gene expression d...
research
06/10/2022

Graph-in-Graph Network for Automatic Gene Ontology Description Generation

Gene Ontology (GO) is the primary gene function knowledge base that enab...
research
03/25/2022

Feature extraction using Spectral Clustering for Gene Function Prediction

Gene annotation addresses the problem of predicting unknown associations...
research
07/13/2022

Hierarchy exploitation to detect missing annotations on hierarchical multi-label classification

The availability of genomic data has grown exponentially in the last dec...
research
05/21/2023

Gene Set Summarization using Large Language Models

Molecular biologists frequently interpret gene lists derived from high-t...
research
05/23/2018

Analysis of Novel Annotations in the Gene Ontology for Boosting the Selection of Negative Examples

Public repositories for genome and proteome annotations, such as the Gen...

Please sign up or login with your details

Forgot password? Click here to reset