Informative Bayesian model selection for RR Lyrae star classifiers

05/24/2021
by   F. Pérez-Galarce, et al.
0

Machine learning has achieved an important role in the automatic classification of variable stars, and several classifiers have been proposed over the last decade. These classifiers have achieved impressive performance in several astronomical catalogues. However, some scientific articles have also shown that the training data therein contain multiple sources of bias. Hence, the performance of those classifiers on objects not belonging to the training data is uncertain, potentially resulting in the selection of incorrect models. Besides, it gives rise to the deployment of misleading classifiers. An example of the latter is the creation of open-source labelled catalogues with biased predictions. In this paper, we develop a method based on an informative marginal likelihood to evaluate variable star classifiers. We collect deterministic rules that are based on physical descriptors of RR Lyrae stars, and then, to mitigate the biases, we introduce those rules into the marginal likelihood estimation. We perform experiments with a set of Bayesian Logistic Regressions, which are trained to classify RR Lyraes, and we found that our method outperforms traditional non-informative cross-validation strategies, even when penalized models are assessed. Our methodology provides a more rigorous alternative to assess machine learning models using astronomical knowledge. From this approach, applications to other classes of variable stars and algorithmic improvements can be developed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2021

Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning

Marginal-likelihood based model-selection, even though promising, is rar...
research
06/14/2021

Last Layer Marginal Likelihood for Invariance Learning

Data augmentation is often used to incorporate inductive biases into mod...
research
08/21/2019

Importance of spatial predictor variable selection in machine learning applications – Moving from data reproduction to spatial prediction

Machine learning algorithms find frequent application in spatial predict...
research
01/10/2013

Classifier Learning with Supervised Marginal Likelihood

It has been argued that in supervised classification tasks, in practice ...
research
10/28/2019

Penalized quasi likelihood estimation for variable selection

Penalized methods are applied to quasi likelihood analysis for stochasti...
research
08/28/2013

Bayesian Conditional Gaussian Network Classifiers with Applications to Mass Spectra Classification

Classifiers based on probabilistic graphical models are very effective. ...
research
09/11/2019

ORES: Lowering Barriers with Participatory Machine Learning in Wikipedia

Algorithmic systems -- from rule-based bots to machine learning classifi...

Please sign up or login with your details

Forgot password? Click here to reset