Subject Cross Validation in Human Activity Recognition

04/04/2019
by   Akbar Dehghani, et al.
0

K-fold Cross Validation is commonly used to evaluate classifiers and tune their hyperparameters. However, it assumes that data points are Independent and Identically Distributed (i.i.d.) so that samples used in the training and test sets can be selected randomly and uniformly. In Human Activity Recognition datasets, we note that the samples produced by the same subjects are likely to be correlated due to diverse factors. Hence, k-fold cross validation may overestimate the performance of activity recognizers, in particular when overlapping sliding windows are used. In this paper, we investigate the effect of Subject Cross Validation on the performance of Human Activity Recognition, both with non-overlapping and with overlapping sliding windows. Results show that k-fold cross validation artificially increases the performance of recognizers by about 10 addition, we do not observe any performance gain from the use of overlapping windows. We conclude that Human Activity Recognition systems should be evaluated by Subject Cross Validation, and that overlapping windows are not worth their extra computational cost.

READ FULL TEXT
research
11/05/2022

Can Ensemble of Classifiers Provide Better Recognition Results in Packaging Activity?

Skeleton-based Motion Capture (MoCap) systems have been widely used in t...
research
03/02/2021

Physical Activity Recognition Based on a Parallel Approach for an Ensemble of Machine Learning and Deep Learning Classifiers

Human activity recognition (HAR) by wearable sensor devices embedded in ...
research
05/28/2020

Estimating the Prediction Performance of Spatial Models via Spatial k-Fold Cross Validation

In machine learning one often assumes the data are independent when eval...
research
10/20/2019

hv-Block Cross Validation is not a BIBD: a Note on the Paper by Jeff Racine (2000)

This note corrects a mistake in the paper "consistent cross-validatory m...
research
02/16/2020

Image Entropy for Classification and Analysis of Pathology Slides

Pathology slides of lung malignancies are classified using the "Salient ...
research
01/23/2020

Improving generalisation of AutoML systems with dynamic fitness evaluations

A common problem machine learning developers are faced with is overfitti...
research
04/11/2021

Affinity-Based Hierarchical Learning of Dependent Concepts for Human Activity Recognition

In multi-class classification tasks, like human activity recognition, it...

Please sign up or login with your details

Forgot password? Click here to reset