Augmenting Physiological Time Series Data: A Case Study for Sleep Apnea Detection

05/22/2019
by   Konstantinos Nikolaidis, et al.
0

Supervised machine learning applications in the health domain often face the problem of insufficient training datasets. The quantity of labelled data is small due to privacy concerns and the cost of data acquisition and labelling by a medical expert. Furthermore, it is quite common that collected data are unbalanced and getting enough data to personalize models for individuals is very expensive or even infeasible. This paper addresses these problems by (1) designing a recurrent Generative Adversarial Network to generate realistic synthetic data and to augment the original dataset, (2) enabling the generation of balanced datasets based on heavily unbalanced dataset, and (3) to control the data generation in such a way that the generated data resembles data from specific individuals. We apply these solutions for sleep apnea detection and study in the evaluation the performance of four well-known techniques, i.e., K-Nearest Neighbour, Random Forest, Multi-Layer Perceptron, and Support Vector Machine. All classifiers exhibit in the experiments a consistent increase in sensitivity and a kappa statistic increase by between 0.007 and 0.182.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2017

Real-valued (Medical) Time Series Generation with Recurrent Conditional GANs

Generative Adversarial Networks (GANs) have shown remarkable success as ...
research
01/27/2022

FinGAN: Generative Adversarial Network for Analytical Customer Relationship Management in Banking and Insurance

Churn prediction in credit cards, fraud detection in insurance, and loan...
research
11/14/2019

Synthetic Event Time Series Health Data Generation

Synthetic medical data which preserves privacy while maintaining utility...
research
10/13/2020

Similarity Based Stratified Splitting: an approach to train better classifiers

We propose a Similarity-Based Stratified Splitting (SBSS) technique, whi...
research
12/15/2020

Detection of Anomalies in a Time Series Data using InfluxDB and Python

Analysis of water and environmental data is an important aspect of many ...
research
03/08/2023

ATM Fraud Detection using Streaming Data Analytics

Gaining the trust and confidence of customers is the essence of the grow...
research
02/17/2020

A survey of statistical learning techniques as applied to inexpensive pediatric Obstructive Sleep Apnea data

Pediatric obstructive sleep apnea affects an estimated 1-5 elementary-sc...

Please sign up or login with your details

Forgot password? Click here to reset