Semi-supervised Learning with the EM Algorithm: A Comparative Study between Unstructured and Structured Prediction

08/28/2020
by   Wenchong He, et al.
21

Semi-supervised learning aims to learn prediction models from both labeled and unlabeled samples. There has been extensive research in this area. Among existing work, generative mixture models with Expectation-Maximization (EM) is a popular method due to clear statistical properties. However, existing literature on EM-based semi-supervised learning largely focuses on unstructured prediction, assuming that samples are independent and identically distributed. Studies on EM-based semi-supervised approach in structured prediction is limited. This paper aims to fill the gap through a comparative study between unstructured and structured methods in EM-based semi-supervised learning. Specifically, we compare their theoretical properties and find that both methods can be considered as a generalization of self-training with soft class assignment of unlabeled samples, but the structured method additionally considers structural constraint in soft class assignment. We conducted a case study on real-world flood mapping datasets to compare the two methods. Results show that structured EM is more robust to class confusion caused by noise and obstacles in features in the context of the flood mapping application.

READ FULL TEXT

page 5

page 6

page 7

page 8

research
11/01/2022

On the Semi-supervised Expectation Maximization

The Expectation Maximization (EM) algorithm is widely used as an iterati...
research
06/19/2019

Semi-supervised Logistic Learning Based on Exponential Tilt Mixture Models

Consider semi-supervised learning for classification, where both labeled...
research
12/05/2022

Rethinking Backdoor Data Poisoning Attacks in the Context of Semi-Supervised Learning

Semi-supervised learning methods can train high-accuracy machine learnin...
research
10/02/2020

Deep Expectation-Maximization for Semi-Supervised Lung Cancer Screening

We present a semi-supervised algorithm for lung cancer screening in whic...
research
06/28/2022

Semi-supervised Contrastive Outlier removal for Pseudo Expectation Maximization (SCOPE)

Semi-supervised learning is the problem of training an accurate predicti...
research
06/27/2012

A Convex Relaxation for Weakly Supervised Classifiers

This paper introduces a general multi-class approach to weakly supervise...
research
02/06/2013

An Information-Theoretic Analysis of Hard and Soft Assignment Methods for Clustering

Assignment methods are at the heart of many algorithms for unsupervised ...

Please sign up or login with your details

Forgot password? Click here to reset