Imbalance Learning for Variable Star Classification

02/27/2020
by   Zafiirah Hosenie, et al.
0

The accurate automated classification of variable stars into their respective sub-types is difficult. Machine learning based solutions often fall foul of the imbalanced learning problem, which causes poor generalisation performance in practice, especially on rare variable star sub-types. In previous work, we attempted to overcome such deficiencies via the development of a hierarchical machine learning classifier. This 'algorithm-level' approach to tackling imbalance, yielded promising results on Catalina Real-Time Survey (CRTS) data, outperforming the binary and multi-class classification schemes previously applied in this area. In this work, we attempt to further improve hierarchical classification performance by applying 'data-level' approaches to directly augment the training data so that they better describe under-represented classes. We apply and report results for three data augmentation methods in particular: Randomly Augmented Sampled Light curves from magnitude Error (RASLE), augmenting light curves with Gaussian Process modelling (GpFit) and the Synthetic Minority Over-sampling Technique (SMOTE). When combining the 'algorithm-level' (i.e. the hierarchical scheme) together with the 'data-level' approach, we further improve variable star classification accuracy by 1-4%. We found that a higher classification rate is obtained when using GpFit in the hierarchical model. Further improvement of the metric scores requires a better standard set of correctly identified variable stars and, perhaps enhanced features are needed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2019

Comparing Multi-class, Binary and Hierarchical Machine Learning Classification schemes for variable stars

Upcoming synoptic surveys are set to generate an unprecedented amount of...
research
07/23/2020

SeismoGlow – Data augmentation for the class imbalance problem

In several application areas, such as medical diagnosis, spam filtering,...
research
02/03/2020

Scalable End-to-end Recurrent Neural Network for Variable star classification

During the last decade, considerable effort has been made to perform aut...
research
09/10/2019

Photometric light curves classification with machine learning

The Large Synoptic Survey Telescope will complete its survey in 2022 and...
research
12/04/2019

Streaming Classification of Variable Stars

In the last years, automatic classification of variable stars has receiv...
research
07/07/2022

Algebraic and machine learning approach to hierarchical triple-star stability

We present two approaches to determine the dynamical stability of a hier...
research
05/01/2020

Automatic Catalog of RRLyrae from ∼ 14 million VVV Light Curves: How far can we go with traditional machine-learning?

The creation of a 3D map of the bulge using RRLyrae (RRL) is one of the ...

Please sign up or login with your details

Forgot password? Click here to reset