One-Step or Two-Step Optimization and the Overfitting Phenomenon: A Case Study on Time Series Classification

07/16/2014
by   Muhammad Marwan Muhammad Fuad, et al.
0

For the last few decades, optimization has been developing at a fast rate. Bio-inspired optimization algorithms are metaheuristics inspired by nature. These algorithms have been applied to solve different problems in engineering, economics, and other domains. Bio-inspired algorithms have also been applied in different branches of information technology such as networking and software engineering. Time series data mining is a field of information technology that has its share of these applications too. In previous works we showed how bio-inspired algorithms such as the genetic algorithms and differential evolution can be used to find the locations of the breakpoints used in the symbolic aggregate approximation of time series representation, and in another work we showed how we can utilize the particle swarm optimization, one of the famous bio-inspired algorithms, to set weights to the different segments in the symbolic aggregate approximation representation. In this paper we present, in two different approaches, a new meta optimization process that produces optimal locations of the breakpoints in addition to optimal weights of the segments. The experiments of time series classification task that we conducted show an interesting example of how the overfitting phenomenon, a frequently encountered problem in data mining which happens when the model overfits the training set, can interfere in the optimization process and hide the superior performance of an optimization algorithm.

READ FULL TEXT

Authors

page 1

page 2

page 3

page 4

12/06/2013

Particle Swarm Optimization of Information-Content Weighting of Symbolic Aggregate Approximation

Bio-inspired optimization algorithms have been gaining more popularity r...
12/24/2021

TSAX is Trending

Time series mining is an important branch of data mining, as time series...
05/01/2019

A Novel Trend Symbolic Aggregate Approximation for Time Series

Symbolic Aggregate approximation (SAX) is a classical symbolic approach ...
01/29/2015

Particle swarm optimization for time series motif discovery

Efficiently finding similar segments or motifs in time series data is a ...
05/25/2022

Towards Symbolic Time Series Representation Improved by Kernel Density Estimators

This paper deals with symbolic time series representation. It builds up ...
04/14/2020

Co-eye: A Multi-resolution Symbolic Representation to TimeSeries Diversified Ensemble Classification

Time series classification (TSC) is a challenging task that attracted ma...
10/02/2020

Modifying the Symbolic Aggregate Approximation Method to Capture Segment Trend Information

The Symbolic Aggregate approXimation (SAX) is a very popular symbolic di...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.