From BOP to BOSS and Beyond: Time Series Classification with Dictionary Based Classifiers

09/18/2018
by   James Large, et al.
0

A family of algorithms for time series classification (TSC) involve running a sliding window across each series, discretising the window to form a word, forming a histogram of word counts over the dictionary, then constructing a classifier on the histograms. A recent evaluation of two of this type of algorithm, Bag of Patterns (BOP) and Bag of Symbolic Fourier Approximation Symbols (BOSS) found a significant difference in accuracy between these seemingly similar algorithms. We investigate this phenomenon by deconstructing the classifiers and measuring the relative importance of the four key components between BOP and BOSS. We find that whilst ensembling is a key component for both algorithms, the effect of the other components is mixed and more complex. We conclude that BOSS represents the state of the art for dictionary based TSC. Both BOP and BOSS can be classed as bag of words approaches. These are particularly popular in Computer Vision for tasks such as image classification. Converting approaches from vision requires careful engineering. We adapt three techniques used in Computer Vision for TSC: Scale Invariant Feature Transform; Spatial Pyramids; and Histrogram Intersection. We find that using Spatial Pyramids in conjunction with BOSS (SP) produces a significantly more accurate classifier. SP is significantly more accurate than standard benchmarks and the original BOSS algorithm. It is not significantly worse than the best shapelet based approach, and is only outperformed by HIVE-COTE, an ensemble that includes BOSS as a constituent module.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2021

The Temporal Dictionary Ensemble (TDE) Classifier for Time Series Classification

Using bag of words representations of time series is a popular approach ...
research
07/26/2019

Scalable Dictionary Classifiers for Time Series Classification

Dictionary based classifiers are a family of algorithms for time series ...
research
11/27/2019

A tale of two toolkits, report the second: bake off redux. Chapter 1. dictionary based classifiers

Time series classification (TSC) is the problem of learning labels from ...
research
04/15/2021

HIVE-COTE 2.0: a new meta ensemble for time series classification

The Hierarchical Vote Collective of Transformation-based Ensembles (HIVE...
research
03/25/2022

HYDRA: Competing convolutional kernels for fast and accurate time series classification

We demonstrate a simple connection between dictionary methods for time s...
research
01/24/2023

WEASEL 2.0 – A Random Dilated Dictionary Transform for Fast, Accurate and Memory Constrained Time Series Classification

A time series is a sequence of sequentially ordered real values in time....
research
08/12/2018

Interpretable Time Series Classification using All-Subsequence Learning and Symbolic Representations in Time and Frequency Domains

The time series classification literature has expanded rapidly over the ...

Please sign up or login with your details

Forgot password? Click here to reset