Selective Inference for Testing Trees and Edges in Phylogenetics

02/13/2019
by   Hidetoshi Shimodaira, et al.
0

Selective inference is considered for testing trees and edges in phylogenetic tree selection from molecular sequences. This improves the previously proposed approximately unbiased test by adjusting the selection bias when testing many trees and edges at the same time. The newly proposed selective inference p-value is useful for testing selected edges to claim that they are significantly supported if p>1-α, whereas the non-selective p-value is still useful for testing candidate trees to claim that they are rejected if p<α. The selective p-value controls the type-I error conditioned on the selection event, whereas the non-selective p-value controls it unconditionally. The selective and non-selective approximately unbiased p-values are computed from two geometric quantities called signed distance and mean curvature of the region representing tree or edge of interest in the space of probability distributions. These two geometric quantities are estimated by fitting a scaling-law model to the non-parametric multiscale bootstrap probabilities. For better understanding the geometric nature of the problem, a visualization of probability distributions is presented. Our general method is applicable to a wider class of problems; phylogenetic tree selection is an example of model selection, and it is interpreted as the variable selection of multiple regression, where each edge corresponds to each predictor. Our method is illustrated in a previously controversial phylogenetic analysis of human, rabbit and mouse.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2019

Selective inference after variable selection via multiscale bootstrap

A general resampling approach is considered for selective inference prob...
research
11/02/2017

Selective inference for the problem of regions via multiscale bootstrap

Selective inference procedures are considered for computing approximatel...
research
06/15/2021

Tree-Values: selective inference for regression trees

We consider conducting inference on the output of the Classification and...
research
04/15/2019

Cramer-Rao Bound for Estimation After Model Selection and its Application to Sparse Vector Estimation

In many practical parameter estimation problems, such as coefficient est...
research
06/24/2023

Selective inference using randomized group lasso estimators for general models

Selective inference methods are developed for group lasso estimators for...
research
09/18/2018

Testing Selective Influence Directly Using Trackball Movement Tasks

Systems factorial technology (SFT; Townsend & Nozawa, 1995) is regarded ...
research
01/15/2023

Selective Inference with Distributed Data

Nowadays, big datasets are spread over many machines which compute in pa...

Please sign up or login with your details

Forgot password? Click here to reset