Using system context information to complement weakly labeled data

07/19/2021
by   Matthias Meyer, et al.
0

Real-world datasets collected with sensor networks often contain incomplete and uncertain labels as well as artefacts arising from the system environment. Complete and reliable labeling is often infeasible for large-scale and long-term sensor network deployments due to the labor and time overhead, limited availability of experts and missing ground truth. In addition, if the machine learning method used for analysis is sensitive to certain features of a deployment, labeling and learning needs to be repeated for every new deployment. To address these challenges, we propose to make use of system context information formalized in an information graph and embed it in the learning process via contrastive learning. Based on real-world data we show that this approach leads to an increased accuracy in case of weakly labeled data and leads to an increased robustness and transferability of the classifier to new sensor locations.

READ FULL TEXT
research
11/27/2019

Learning with less data via Weakly Labeled Patch Classification in Digital Pathology

In Digital Pathology (DP), labeled data is generally very scarce due to ...
research
03/26/2017

Who Said What: Modeling Individual Labelers Improves Classification

Data are often labeled by many different experts with each expert only l...
research
03/24/2019

Attention-based Convolutional Neural Network for Weakly Labeled Human Activities Recognition with Wearable Sensors

Unlike images or videos data which can be easily labeled by human being,...
research
08/14/2023

Channel-Wise Contrastive Learning for Learning with Noisy Labels

In real-world datasets, noisy labels are pervasive. The challenge of lea...
research
04/20/2021

On Generating and Labeling Network Traffic with Realistic, Self-Propagating Malware

Research and development of techniques which detect or remediate malicio...
research
05/13/2020

Adaptive Rule Discovery for Labeling Text Data

Creating and collecting labeled data is one of the major bottlenecks in ...

Please sign up or login with your details

Forgot password? Click here to reset