openXDATA: A Tool for Multi-Target Data Generation and Missing Label Completion

07/27/2020
by   Felix Weninger, et al.
0

A common problem in machine learning is to deal with datasets with disjoint label spaces and missing labels. In this work, we introduce the openXDATA tool that completes the missing labels in partially labelled or unlabelled datasets in order to generate multi-target data with labels in the joint label space of the datasets. To this end, we designed and implemented the cross-data label completion (CDLC) algorithm that uses a multi-task shared-hidden-layer DNN to iteratively complete the sparse label matrix of the instances from the different datasets. We apply the new tool to estimate labels across four emotion datasets: one labeled with discrete emotion categories (e.g., happy, sad, angry), one labeled with continuous values along arousal and valence dimensions, one with both kinds of labels, and one unlabeled. Testing with drop-out of true labels, we show the ability to estimate both categories and continuous labels for all of the datasets, at rates that approached the ground truth values. openXDATA is available under the GNU General Public License from https://github.com/fweninger/openXDATA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2022

Unifying the Discrete and Continuous Emotion labels for Speech Emotion Recognition

Traditionally, in paralinguistic analysis for emotion detection from spe...
research
03/17/2020

Partial Multi-label Learning with Label and Feature Collaboration

Partial multi-label learning (PML) models the scenario where each traini...
research
02/25/2023

Partial Label Learning for Emotion Recognition from EEG

Fully supervised learning has recently achieved promising performance in...
research
03/19/2022

Font Generation with Missing Impression Labels

Our goal is to generate fonts with specific impressions, by training a g...
research
05/31/2019

Max-MIG: an Information Theoretic Approach for Joint Learning from Crowds

Eliciting labels from crowds is a potential way to obtain large labeled ...
research
09/10/2023

Anatomy Completor: A Multi-class Completion Framework for 3D Anatomy Reconstruction

In this paper, we introduce a completion framework to reconstruct the ge...
research
10/28/2020

An Approach for GCI Fusion With Labeled Multitarget Densities

This paper addresses the Generalized Covariance Intersection (GCI) fusio...

Please sign up or login with your details

Forgot password? Click here to reset