Predicting conditional probability distributions of redshifts of Active Galactic Nuclei using Hierarchical Correlation Reconstruction

06/13/2022
by   Jarek Duda, et al.
0

While there is a general focus on prediction of values, real data often only allows to predict conditional probability distributions, with capabilities bounded by conditional entropy H(Y|X). If additionally estimating uncertainty, we can treat a predicted value as the center of Gaussian of Laplace distribution - idealization which can be far from complex conditional distributions of real data. This article applies Hierarchical Correlation Reconstruction (HCR) approach to inexpensively predict quite complex conditional probability distributions (e.g. multimodal): by independent MSE estimation of multiple moment-like parameters, which allow to reconstruct the conditional distribution. Using linear regression for this purpose, we get interpretable models: with coefficients describing contributions of features to conditional moments. This article extends on the original approach especially by using Canonical Correlation Analysis (CCA) for feature optimization and l1 "lasso" regularization, focusing on practical problem of prediction of redshift of Active Galactic Nuclei (AGN) based on Fourth Fermi-LAT Data Release 2 (4LAC) dataset.

READ FULL TEXT

page 1

page 2

research
11/04/2019

Modelling bid-ask spread conditional distributions using hierarchical correlation reconstruction

While we would like to predict exact values, available incomplete inform...
research
07/21/2022

Low cost prediction of probability distributions of molecular properties for early virtual screening

While there is a general focus on predictions of values, mathematically ...
research
07/11/2018

Exploiting statistical dependencies of time series with hierarchical correlation reconstruction

While we are usually focused on predicting future values of time series,...
research
10/16/2012

Spectral Estimation of Conditional Random Graph Models for Large-Scale Network Data

Generative models for graphs have been typically committed to strong pri...
research
06/16/2021

An Imprecise SHAP as a Tool for Explaining the Class Probability Distributions under Limited Training Data

One of the most popular methods of the machine learning prediction expla...
research
06/10/2019

Multimodal Data Fusion of Non-Gaussian Spatial Fields in Sensor Networks

We develop a robust data fusion algorithm for field reconstruction of mu...
research
06/08/2022

TreeFlow: Going beyond Tree-based Gaussian Probabilistic Regression

The tree-based ensembles are known for their outstanding performance for...

Please sign up or login with your details

Forgot password? Click here to reset