Photometric Redshift Estimation with Convolutional Neural Networks and Galaxy Images: A Case Study of Resolving Biases in Data-Driven Methods

02/21/2022
by   Q. Lin, et al.
2

Deep Learning models have been increasingly exploited in astrophysical studies, yet such data-driven algorithms are prone to producing biased outputs detrimental for subsequent analyses. In this work, we investigate two major forms of biases, i.e., class-dependent residuals and mode collapse, in a case study of estimating photometric redshifts as a classification problem using Convolutional Neural Networks (CNNs) and galaxy images with spectroscopic redshifts. We focus on point estimates and propose a set of consecutive steps for resolving the two biases based on CNN models, involving representation learning with multi-channel outputs, balancing the training data and leveraging soft labels. The residuals can be viewed as a function of spectroscopic redshifts or photometric redshifts, and the biases with respect to these two definitions are incompatible and should be treated in a split way. We suggest that resolving biases in the spectroscopic space is a prerequisite for resolving biases in the photometric space. Experiments show that our methods possess a better capability in controlling biases compared to benchmark methods, and exhibit robustness under varying implementing and training conditions provided with high-quality data. Our methods have promises for future cosmological surveys that require a good constraint of biases, and may be applied to regression problems and other studies that make use of data-driven models. Nonetheless, the bias-variance trade-off and the demand on sufficient statistics suggest the need for developing better methodologies and optimizing data usage strategies.

READ FULL TEXT

page 5

page 7

page 13

page 26

research
11/29/2020

Inflating Topic Relevance with Ideology: A Case Study of Political Ideology Bias in Social Topic Detection Models

We investigate the impact of political ideology biases in training data....
research
05/23/2020

Peri-Net-Pro: The neural processes with quantified uncertainty for crack patterns

This paper uses the peridynamic theory, which is well-suited to crack st...
research
11/03/2021

Rethinking the Image Feature Biases Exhibited by Deep CNN Models

In recent years, convolutional neural networks (CNNs) have been applied ...
research
07/02/2019

Quantifying Algorithmic Biases over Time

Algorithms now permeate multiple aspects of human lives and multiple rec...
research
03/02/2021

EnD: Entangling and Disentangling deep representations for bias correction

Artificial neural networks perform state-of-the-art in an ever-growing n...
research
08/06/2020

Discovering and Categorising Language Biases in Reddit

We present a data-driven approach using word embeddings to discover and ...
research
12/01/2022

Task Discovery: Finding the Tasks that Neural Networks Generalize on

When developing deep learning models, we usually decide what task we wan...

Please sign up or login with your details

Forgot password? Click here to reset