Neural Networks beyond explainability: Selective inference for sequence motifs

12/23/2022
by   Antoine Villié, et al.
0

Over the past decade, neural networks have been successful at making predictions from biological sequences, especially in the context of regulatory genomics. As in other fields of deep learning, tools have been devised to extract features such as sequence motifs that can explain the predictions made by a trained network. Here we intend to go beyond explainable machine learning and introduce SEISM, a selective inference procedure to test the association between these extracted features and the predicted phenotype. In particular, we discuss how training a one-layer convolutional network is formally equivalent to selecting motifs maximizing some association score. We adapt existing sampling-based selective inference procedures by quantizing this selection over an infinite set to a large but finite grid. Finally, we show that sampling under a specific choice of parameters is sufficient to characterize the composite null hypothesis typically used for selective inference-a result that goes well beyond our particular framework. We illustrate the behavior of our method in terms of calibration, power and speed and discuss its power/speed trade-off with a simpler data-split strategy. SEISM paves the way to an easier analysis of neural networks used in regulatory genomics, and to more powerful methods for genome wide association studies (GWAS).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2023

Selective Conformal Inference with FCR Control

Conformal inference is a popular tool for constructing prediction interv...
research
11/16/2019

Selective sampling for accelerating training of deep neural networks

We present a selective sampling method designed to accelerate the traini...
research
11/19/2022

Gumbel-Softmax Selective Networks

ML models often operate within the context of a larger system that can a...
research
06/09/2021

Ghosts in Neural Networks: Existence, Structure and Role of Infinite-Dimensional Null Space

Overparametrization has been remarkably successful for deep learning stu...
research
07/27/2022

Conditional Versus Unconditional Approaches to Selective Inference

We investigate a class of methods for selective inference that condition...
research
12/25/2022

Exact Selective Inference with Randomization

We introduce a pivot for exact selective inference with randomization. N...
research
09/29/2020

Selective Cascade of Residual ExtraTrees

We propose a novel tree-based ensemble method named Selective Cascade of...

Please sign up or login with your details

Forgot password? Click here to reset