Deep vs. Shallow Learning: A Benchmark Study in Low Magnitude Earthquake Detection

05/01/2022
by   Akshat Goel, et al.
0

While deep learning models have seen recent high uptake in the geosciences, and are appealing in their ability to learn from minimally processed input data, as black box models they do not provide an easy means to understand how a decision is reached, which in safety-critical tasks especially can be problematical. An alternative route is to use simpler, more transparent white box models, in which task-specific feature construction replaces the more opaque feature discovery process performed automatically within deep learning models. Using data from the Groningen Gas Field in the Netherlands, we build on an existing logistic regression model by the addition of four further features discovered using elastic net driven data mining within the catch22 time series analysis package. We then evaluate the performance of the augmented logistic regression model relative to a deep (CNN) model, pre-trained on the Groningen data, on progressively increasing noise-to-signal ratios. We discover that, for each ratio, our logistic regression model correctly detects every earthquake, while the deep model fails to detect nearly 20 justifying at least a degree of caution in the application of deep models, especially to data with higher noise-to-signal ratios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2020

Winning with Simple Learning Models: Detecting Earthquakes in Groningen, the Netherlands

Deep learning is fast emerging as a potential disruptive tool to tackle ...
research
12/14/2021

Classifying Emails into Human vs Machine Category

It is an essential product requirement of Yahoo Mail to distinguish betw...
research
10/11/2022

A Latent Logistic Regression Model with Graph Data

Recently, graph (network) data is an emerging research area in artificia...
research
09/17/2018

Revisit Multinomial Logistic Regression in Deep Learning: Data Dependent Model Initialization for Image Recognition

We study in this paper how to initialize the parameters of multinomial l...
research
03/10/2022

SATLab at SemEval-2022 Task 4: Trying to Detect Patronizing and Condescending Language with only Character and Word N-grams

A logistic regression model only fed with character and word n-grams is ...
research
05/17/2021

Modeling the EdNet Dataset with Logistic Regression

Many of these challenges are won by neural network models created by ful...
research
08/24/2019

Ontology alignment: A Content-Based Bayesian Approach

There are many legacy databases, and related stores of information that ...

Please sign up or login with your details

Forgot password? Click here to reset