Drifting Features: Detection and evaluation in the context of automatic RRLs identification in VVV

05/04/2021
by   J. B. Cabral, et al.
0

As most of the modern astronomical sky surveys produce data faster than humans can analyze it, Machine Learning (ML) has become a central tool in Astronomy. Modern ML methods can be characterized as highly resistant to some experimental errors. However, small changes on the data over long distances or long periods of time, which cannot be easily detected by statistical methods, can be harmful to these methods. We develop a new strategy to cope with this problem, also using ML methods in an innovative way, to identify these potentially harmful features. We introduce and discuss the notion of Drifting Features, related with small changes in the properties as measured in the data features. We use the identification of RRLs in VVV based on an earlier work and introduce a method for detecting Drifting Features. Our method forces a classifier to learn the tile of origin of diverse sources (mostly stellar 'point sources'), and select the features more relevant to the task of finding candidates to Drifting Features. We show that this method can efficiently identify a reduced set of features that contains useful information about the tile of origin of the sources. For our particular example of detecting RRLs in VVV, we find that Drifting Features are mostly related to color indices. On the other hand, we show that, even if we have a clear set of Drifting Features in our problem, they are mostly insensitive to the identification of RRLs. Drifting Features can be efficiently identified using ML methods. However, in our example, removing Drifting Features does not improve the identification of RRLs.

READ FULL TEXT
research
02/20/2020

Pulsars Detection by Machine Learning with Very Few Features

It is an active topic to investigate the schemes based on machine learni...
research
05/01/2020

Automatic Catalog of RRLyrae from ∼ 14 million VVV Light Curves: How far can we go with traditional machine-learning?

The creation of a 3D map of the bulge using RRLyrae (RRL) is one of the ...
research
12/02/2020

A Novel Approach to Radiometric Identification

This paper demonstrates that highly accurate radiometric identification ...
research
05/07/2021

Detecting Security Fixes in Open-Source Repositories using Static Code Analyzers

The sources of reliable, code-level information about vulnerabilities th...
research
11/24/2022

Data Origin Inference in Machine Learning

It is a growing direction to utilize unintended memorization in ML model...
research
07/30/2020

The Unreasonable Effectiveness of Machine Learning in Moldavian versus Romanian Dialect Identification

In this work, we provide a follow-up on the Moldavian versus Romanian Cr...
research
09/09/2022

Exploiting Nanoelectronic Properties of Memory Chips for Prevention of IC Counterfeiting

This study presents a methodology for anticounterfeiting of Non-Volatile...

Please sign up or login with your details

Forgot password? Click here to reset