Nutrition and Health Data for Cost-Sensitive Learning

02/19/2019
by   Mohammad Kachuee, et al.
0

Traditionally, machine learning algorithms have been focused on modeling dynamics of a certain dataset at hand for which all features are available for free. However, there are many concerns such as monetary data collection costs, patient discomfort in medical procedures, and privacy impacts of data collection that require careful consideration in any health analytics system. An efficient solution would only acquire a subset of features based on the value it provides whilst considering acquisition costs. Moreover, datasets that provide feature costs are very limited, especially in healthcare. In this paper, we provide a health dataset as well as a method for assigning feature costs based on the total level of inconvenience asking for each feature entails. Furthermore, based on the suggested dataset, we provide a comparison of recent and state-of-the-art approaches to cost-sensitive feature acquisition and learning. Specifically, we analyze the performance of major sensitivity-based and reinforcement learning based methods in the literature on three different problems in the health domain, including diabetes, heart disease, and hypertension classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/17/2019

Cost-Sensitive Feature-Value Acquisition Using Feature Relevance

In many real-world machine learning problems, feature values are not rea...
research
07/31/2023

Generative models for wearables data

Data scarcity is a common obstacle in medical research due to the high c...
research
01/02/2019

Opportunistic Learning: Budgeted Cost-Sensitive Learning from Data Streams

In many real-world learning scenarios, features are only acquirable at a...
research
02/27/2023

Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition

We develop novel methodology for active feature acquisition (AFA), the s...
research
05/23/2023

Happy or Evil Laughter? Analysing a Database of Natural Audio Samples

We conducted a data collection on the basis of the Google AudioSet datab...
research
01/02/2019

Cost-sensitive Selection of Variables by Ensemble of Model Sequences

Many applications require the collection of data on different variables ...
research
01/16/2014

Value of Information Lattice: Exploiting Probabilistic Independence for Effective Feature Subset Acquisition

We address the cost-sensitive feature acquisition problem, where misclas...

Please sign up or login with your details

Forgot password? Click here to reset