Pulsars Detection by Machine Learning with Very Few Features

02/20/2020
by   Haitao Lin, et al.
0

It is an active topic to investigate the schemes based on machine learning (ML) methods for detecting pulsars as the data volume growing exponentially in modern surveys. To improve the detection performance, input features into an ML model should be investigated specifically. In the existing pulsar detection researches based on ML methods, there are mainly two kinds of feature designs: the empirical features and statistical features. Due to the combinational effects from multiple features, however, there exist some redundancies and even irrelevant components in the available features, which can reduce the accuracy of a pulsar detection model. Therefore, it is essential to select a subset of relevant features from a set of available candidate features and known as feature selection. In this work, two feature selection algorithms —-Grid Search (GS) and Recursive Feature Elimination (RFE)—- are proposed to improve the detection performance by removing the redundant and irrelevant features. The algorithms were evaluated on the Southern High Time Resolution University survey (HTRU-S) with five pulsar detection models. The experimental results verify the effectiveness and efficiency of our proposed feature selection algorithms. By the GS, a model with only two features reach a recall rate as high as 99% and a false positive rate (FPR) as low as 0.65%; By the RFE, another model with only three features achieves a recall rate 99% and an FPR of 0.16% in pulsar candidates classification. Furthermore, this work investigated the number of features required as well as the misclassified pulsars by our models.

READ FULL TEXT

page 8

page 9

page 12

research
03/30/2022

IGRF-RFE: A Hybrid Feature Selection Method for MLP-based Network Intrusion Detection on UNSW-NB15 Dataset

The effectiveness of machine learning models is significantly affected b...
research
11/23/2021

Filter Methods for Feature Selection in Supervised Machine Learning Applications – Review and Benchmark

The amount of data for machine learning (ML) applications is constantly ...
research
05/04/2021

Drifting Features: Detection and evaluation in the context of automatic RRLs identification in VVV

As most of the modern astronomical sky surveys produce data faster than ...
research
10/05/2021

Feature Selection by a Mechanism Design

In constructing an econometric or statistical model, we pick relevant fe...
research
08/19/2023

Utilizing Semantic Textual Similarity for Clinical Survey Data Feature Selection

Survey data can contain a high number of features while having a compara...
research
03/14/2016

Rapid building detection using machine learning

This work describes algorithms for performing discrete object detection,...
research
09/23/2021

Federated Feature Selection for Cyber-Physical Systems of Systems

Autonomous systems generate a huge amount of multimodal data that are co...

Please sign up or login with your details

Forgot password? Click here to reset