Learning with Wasserstein barycenters and applications

12/26/2019
by   G. Domazakis, et al.
14

In this work, learning schemes for measure-valued data are proposed, i.e. data that their structure can be more efficiently represented as probability measures instead of points on ^d, employing the concept of probability barycenters as defined with respect to the Wasserstein metric. Such type of learning approaches are highly appreciated in many fields where the observational/experimental error is significant (e.g. astronomy, biology, remote sensing, etc.) or the data nature is more complex and the traditional learning algorithms are not applicable or effective to treat them (e.g. network data, interval data, high frequency records, matrix data, etc.). Under this perspective, each observation is identified by an appropriate probability measure and the proposed statistical learning schemes rely on discrimination criteria that utilize the geometric structure of the space of probability measures through core techniques from the optimal transport theory. The discussed approaches are implemented in two real world applications: (a) clustering eurozone countries according to their observed government bond yield curves and (b) classifying the areas of a satellite image to certain land uses categories which is a standard task in remote sensing. In both case studies the results are particularly interesting and meaningful while the accuracy obtained is high.

READ FULL TEXT

page 15

page 16

page 20

research
01/03/2022

Transport type metrics on the space of probability measures involving singular base measures

We develop the theory of a metric, which we call the ν-based Wasserstein...
research
02/22/2022

The Winning Solution to the iFLYTEK Challenge 2021 Cultivated Land Extraction from High-Resolution Remote Sensing Image

Extracting cultivated land accurately from high-resolution remote images...
research
09/21/2022

Quantitative Stability of Barycenters in the Wasserstein Space

Wasserstein barycenters define averages of probability measures in a geo...
research
04/15/2013

A new Bayesian ensemble of trees classifier for identifying multi-class labels in satellite images

Classification of satellite images is a key component of many remote sen...
research
03/01/2022

Exploring Wilderness Using Explainable Machine Learning in Satellite Imagery

Wilderness areas offer important ecological and social benefits, and the...
research
12/29/2013

Probabilistic Archetypal Analysis

Archetypal analysis represents a set of observations as convex combinati...

Please sign up or login with your details

Forgot password? Click here to reset