Improving Accuracy of Permutation DAG Search using Best Order Score Search

08/17/2021
by   Joseph D. Ramsey, et al.
0

The Sparsest Permutation (SP) algorithm is accurate but limited to about 9 variables in practice; the Greedy Sparest Permutation (GSP) algorithm is faster but less weak theoretically. A compromise can be given, the Best Order Score Search, which gives results as accurate as SP but for much larger and denser graphs. BOSS (Best Order Score Search) is more accurate for two reason: (a) It assumes the "brute faithfuness" assumption, which is weaker than faithfulness, and (b) it uses a different traversal of permutations than the depth first traversal used by GSP, obtained by taking each variable in turn and moving it to the position in the permutation that optimizes the model score. Results are given comparing BOSS to several related papers in the literature in terms of performance, for linear, Gaussian data. In all cases, with the proper parameter settings, accuracy of BOSS is lifted considerably with respect to competing approaches. In configurations tested, models with 60 variables are feasible with large samples out to about an average degree of 12 in reasonable time, with near-perfect accuracy, and sparse models with an average degree of 4 are feasible out to about 300 variables on a laptop, again with near-perfect accuracy. Mixed continuous discrete and all-discrete datasets were also tested. The mixed data analysis showed advantage for BOSS over GES more apparent at higher depths with the same score; the discrete data analysis showed a very small advantage for BOSS over GES with the same score, perhaps not enough to prefer it.

READ FULL TEXT
research
07/28/2015

Scaling up Greedy Causal Search for Continuous Variables

As standardly implemented in R or the Tetrad program, causal search algo...
research
06/11/2022

Greedy Relaxations of the Sparsest Permutation Algorithm

There has been an increasing interest in methods that exploit permutatio...
research
09/14/2023

Statistically Valid Variable Importance Assessment through Conditional Permutations

Variable importance assessment has become a crucial step in machine-lear...
research
11/06/2020

Efficient Permutation Discovery in Causal DAGs

The problem of learning a directed acyclic graph (DAG) up to Markov equi...
research
05/10/2020

HNet: Graphical Hypergeometric Networks

Motivation: Real-world data often contain measurements with both continu...
research
12/09/2011

Information and Search in Computer Chess

The article describes a model of chess based on information theory. A ma...

Please sign up or login with your details

Forgot password? Click here to reset