Exoplanet Validation with Machine Learning: 50 new validated Kepler planets

08/24/2020
by   David J. Armstrong, et al.
0

Over 30 'validation', where the statistical likelihood of a transit arising from a false positive (FP), non-planetary scenario is calculated. For the large majority of these validated planets calculations were performed using the vespa algorithm (Morton et al. 2016). Regardless of the strengths and weaknesses of vespa, it is highly desirable for the catalogue of known planets not to be dependent on a single method. We demonstrate the use of machine learning algorithms, specifically a gaussian process classifier (GPC) reinforced by other models, to perform probabilistic planet validation incorporating prior probabilities for possible FP scenarios. The GPC can attain a mean log-loss per sample of 0.54 when separating confirmed planets from FPs in the Kepler threshold crossing event (TCE) catalogue. Our models can validate thousands of unseen candidates in seconds once applicable vetting metrics are calculated, and can be adapted to work with the active TESS mission, where the large number of observed targets necessitates the use of automated algorithms. We discuss the limitations and caveats of this methodology, and after accounting for possible failure modes newly validate 50 Kepler candidates as planets, sanity checking the validations by confirming them with vespa using up to date stellar information. Concerning discrepancies with vespa arise for many other candidates, which typically resolve in favour of our models. Given such issues, we caution against using single-method planet validation with either method until the discrepancies are fully understood.

READ FULL TEXT

page 5

page 7

page 14

research
05/04/2023

Multiplicity Boost Of Transit Signal Classifiers: Validation of 69 New Exoplanets Using The Multiplicity Boost of ExoMiner

Most existing exoplanets are discovered using validation techniques rath...
research
05/31/2021

Predicting Chronic Homelessness: The Importance of Comparing Algorithms using Client Histories

This paper investigates how to best compare algorithms for predicting ch...
research
04/27/2021

Sample selection from a given dataset to validate machine learning models

The selection of a validation basis from a full dataset is often require...
research
12/10/2020

RLeave: an in silico cross-validation protocol for transcript differential expression analysis

Background and Objective: The massive parallel sequencing technology fac...
research
06/23/2019

Single-crossing Implementation

An election over a finite set of candidates is called single-crossing if...
research
04/10/2021

Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes

Complex data pipelines are increasingly common in diverse applications s...

Please sign up or login with your details

Forgot password? Click here to reset