Finding rare objects and building pure samples: Probabilistic quasar classification from low resolution Gaia spectra

09/19/2008
by   C. A. L. Bailer-Jones, et al.
0

We develop and demonstrate a probabilistic method for classifying rare objects in surveys with the particular goal of building very pure samples. It works by modifying the output probabilities from a classifier so as to accommodate our expectation (priors) concerning the relative frequencies of different classes of objects. We demonstrate our method using the Discrete Source Classifier, a supervised classifier currently based on Support Vector Machines, which we are developing in preparation for the Gaia data analysis. DSC classifies objects using their very low resolution optical spectra. We look in detail at the problem of quasar classification, because identification of a pure quasar sample is necessary to define the Gaia astrometric reference frame. By varying a posterior probability threshold in DSC we can trade off sample completeness and contamination. We show, using our simulated data, that it is possible to achieve a pure sample of quasars (upper limit on contamination of 1 in 40,000) with a completeness of 65 G=20.0, even when quasars have a frequency of only 1 in every 2000 objects. The star sample completeness is simultaneously 99 Including parallax and proper motion in the classifier barely changes the results. We further show that not accounting for class priors in the target population leads to serious misclassifications and poor predictions for sample completeness and contamination. (Truncated)

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2020

Active deep learning method for the discovery of objects of interest in large spectroscopic surveys

Current archives of the LAMOST telescope contain millions of pipeline-pr...
research
03/18/2021

A Pilot Study For Fragment Identification Using 2D NMR and Deep Learning

This paper presents a method to identify substructures in NMR spectra of...
research
08/29/2018

QuasarNET: Human-level spectral classification and redshifting with Deep Neural Networks

We introduce QuasarNET, a deep convolutional neural network that perform...
research
06/13/2016

Specialized Support Vector Machines for open-set recognition

Often, when dealing with real-world recognition problems, we do not need...
research
05/02/2023

Unpaired Downscaling of Fluid Flows with Diffusion Bridges

We present a method to downscale idealized geophysical fluid simulations...
research
02/11/2018

Band Target Entropy Minimization and Target Partial Least Squares for Spectral Recovery and Calibration

The resolution and calibration of pure spectra of minority components in...
research
02/11/2018

Band Target Entropy Minimization and Target Partial Least Squares for Single Target Multivariate Curve Resolution and Calibration

Band target entropy minimization (BTEM) and target partial least squares...

Please sign up or login with your details

Forgot password? Click here to reset