Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows

04/20/2012
by   Julien Mairal, et al.
0

We consider supervised learning problems where the features are embedded in a graph, such as gene expressions in a gene network. In this context, it is of much interest to automatically select a subgraph with few connected components; by exploiting prior knowledge, one can indeed improve the prediction performance or obtain results that are easier to interpret. Regularization or penalty functions for selecting features in graphs have recently been proposed, but they raise new algorithmic challenges. For example, they typically require solving a combinatorially hard selection problem among all connected subgraphs. In this paper, we propose computationally feasible strategies to select a sparse and well-connected subset of features sitting on a directed acyclic graph (DAG). We introduce structured sparsity penalties over paths on a DAG called "path coding" penalties. Unlike existing regularization functions that model long-range interactions between features in a graph, path coding penalties are tractable. The penalties and their proximal operators involve path selection problems, which we efficiently solve by leveraging network flow optimization. We experimentally show on synthetic, image, and genomic data that our approach is scalable and leads to more connected subgraphs than other regularization functions for graphs.

READ FULL TEXT
research
03/24/2019

DSL: Discriminative Subgraph Learning via Sparse Self-Representation

The goal in network state prediction (NSP) is to classify the global sta...
research
10/01/2022

An experimental study of algorithms for obtaining a singly connected subgraph

A directed graph G = (V,E) is singly connected if for any two vertice...
research
11/11/2019

Towards Gallai's path decomposition conjecture

A path decomposition of a graph G is a collection of edge-disjoint paths...
research
06/04/2021

Top-k Regularization for Supervised Feature Selection

Feature selection identifies subsets of informative features and reduces...
research
04/22/2022

Gene Function Prediction with Gene Interaction Networks: A Context Graph Kernel Approach

Predicting gene functions is a challenge for biologists in the post geno...
research
12/27/2017

Enumerating consistent subgraphs of directed acyclic graphs: an insight into biomedical ontologies

Modern problems of concept annotation associate an object of interest (g...
research
03/20/2014

Sparse Learning over Infinite Subgraph Features

We present a supervised-learning algorithm from graph data (a set of gra...

Please sign up or login with your details

Forgot password? Click here to reset