Finding the Perfect Fit: Applying Regression Models to ClimateBench v1.0

08/23/2023
by   Anmol Chaure, et al.
0

Climate projections using data driven machine learning models acting as emulators, is one of the prevailing areas of research to enable policy makers make informed decisions. Use of machine learning emulators as surrogates for computationally heavy GCM simulators reduces time and carbon footprints. In this direction, ClimateBench [1] is a recently curated benchmarking dataset for evaluating the performance of machine learning emulators designed for climate data. Recent studies have reported that despite being considered fundamental, regression models offer several advantages pertaining to climate emulations. In particular, by leveraging the kernel trick, regression models can capture complex relationships and improve their predictive capabilities. This study focuses on evaluating non-linear regression models using the aforementioned dataset. Specifically, we compare the emulation capabilities of three non-linear regression models. Among them, Gaussian Process Regressor demonstrates the best-in-class performance against standard evaluation metrics used for climate field emulation studies. However, Gaussian Process Regression suffers from being computational resource hungry in terms of space and time complexity. Alternatively, Support Vector and Kernel Ridge models also deliver competitive results and but there are certain trade-offs to be addressed. Additionally, we are actively investigating the performance of composite kernels and techniques such as variational inference to further enhance the performance of the regression models and effectively model complex non-linear patterns, including phenomena like precipitation.

READ FULL TEXT

page 1

page 6

page 7

page 8

page 9

research
11/21/2022

Constructing Effective Machine Learning Models for the Sciences: A Multidisciplinary Perspective

Learning from data has led to substantial advances in a multitude of dis...
research
11/03/2021

Evaluation of Tree Based Regression over Multiple Linear Regression for Non-normally Distributed Data in Battery Performance

Battery performance datasets are typically non-normal and multicollinear...
research
04/04/2018

Evaluating Hospital Case Cost Prediction Models Using Azure Machine Learning Studio

Ability for accurate hospital case cost modelling and prediction is crit...
research
03/05/2020

Flexible Bayesian Nonlinear Model Configuration

Regression models are used in a wide range of applications providing a p...
research
05/25/2022

Machine learning method for return direction forecasting of Exchange Traded Funds using classification and regression models

This article aims to propose and apply a machine learning method to anal...
research
11/30/2021

Leveraging Intrinsic Gradient Information for Machine Learning Model Training

Designing models that produce accurate predictions is the fundamental ob...
research
11/02/2022

Variational Hierarchical Mixtures for Learning Probabilistic Inverse Dynamics

Well-calibrated probabilistic regression models are a crucial learning c...

Please sign up or login with your details

Forgot password? Click here to reset