Bag-of-Words Method Applied to Accelerometer Measurements for the Purpose of Classification and Energy Estimation

04/05/2017
by   Kevin M. Amaral, et al.
0

Accelerometer measurements are the prime type of sensor information most think of when seeking to measure physical activity. On the market, there are many fitness measuring devices which aim to track calories burned and steps counted through the use of accelerometers. These measurements, though good enough for the average consumer, are noisy and unreliable in terms of the precision of measurement needed in a scientific setting. The contribution of this paper is an innovative and highly accurate regression method which uses an intermediary two-stage classification step to better direct the regression of energy expenditure values from accelerometer counts. We show that through an additional unsupervised layer of intermediate feature construction, we can leverage latent patterns within accelerometer counts to provide better grounds for activity classification than expert-constructed timeseries features. For this, our approach utilizes a mathematical model originating in natural language processing, the bag-of-words model, that has in the past years been appearing in diverse disciplines outside of the natural language processing field such as image processing. Further emphasizing the natural language connection to stochastics, we use a gaussian mixture model to learn the dictionary upon which the bag-of-words model is built. Moreover, we show that with the addition of these features, we're able to improve regression root mean-squared error of energy expenditure by approximately 1.4 units over existing state-of-the-art methods.

READ FULL TEXT
research
06/05/2022

Near-Term Advances in Quantum Natural Language Processing

This paper describes experiments showing that some problems in natural l...
research
11/10/2019

Un systeme de lemmatisation pour les applications de TALN

This paper presents a method of stemming for the Arabian texts based on ...
research
07/18/2017

Spherical Paragraph Model

Representing texts as fixed-length vectors is central to many language p...
research
03/17/2021

UniParma @ SemEval 2021 Task 5: Toxic Spans Detection Using CharacterBERT and Bag-of-Words Model

With the ever-increasing availability of digital information, toxic cont...
research
11/30/2022

Measurement of Investment activity in China based on Natural language processing technology

The purpose of this study is to propose a new index to measure and refle...

Please sign up or login with your details

Forgot password? Click here to reset