Fractionally-Supervised Classification

07/13/2013
by   Irene Vrbik, et al.
0

Traditionally, there are three species of classification: unsupervised, supervised, and semi-supervised. Supervised and semi-supervised classification differ by whether or not weight is given to unlabelled observations in the classification procedure. In unsupervised classification, or clustering, all observations are unlabeled and hence full weight is given to unlabelled observations. When some observations are unlabelled, it can be very difficult to a priori choose the optimal level of supervision, and the consequences of a sub-optimal choice can be non-trivial. A flexible fractionally-supervised approach to classification is introduced, where any level of supervision --- ranging from unsupervised to supervised --- can be attained. Our approach uses a weighted likelihood, wherein weights control the relative role that labelled and unlabelled data have in building a classifier. A comparison between our approach and the traditional species is presented using simulated and real data. Gaussian mixture models are used as a vehicle to illustrate our fractionally-supervised classification approach; however, it is broadly applicable and variations on the postulated model can be easily made.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2018

Clustering and Semi-Supervised Classification for Clickstream Data via Mixture Models

Finite mixture models have been used for unsupervised learning for over ...
research
09/24/2017

On Fractionally-Supervised Classification: Weight Selection and Extension to the Multivariate t-Distribution

Recent work on fractionally-supervised classification (FSC), an approach...
research
10/08/2021

Automated Feature-Specific Tree Species Identification from Natural Images using Deep Semi-Supervised Learning

Prior work on plant species classification predominantly focuses on buil...
research
05/03/2017

Semi-supervised cross-entropy clustering with information bottleneck constraint

In this paper, we propose a semi-supervised clustering method, CEC-IB, t...
research
11/20/2019

Parsimonious Mixtures of Matrix Variate Bilinear Factor Analyzers

Over the years, data have become increasingly higher dimensional, which ...
research
10/15/2020

Semi-supervised NMF Models for Topic Modeling in Learning Tasks

We propose several new models for semi-supervised nonnegative matrix fac...
research
02/10/2021

Dynamic β-VAEs for quantifying biodiversity by clustering optically recorded insect signals

While insects are the largest and most diverse group of animals, constit...

Please sign up or login with your details

Forgot password? Click here to reset