Clustering above Exponential Families with Tempered Exponential Measures

11/04/2022
by   Ehsan Amid, et al.
0

The link with exponential families has allowed k-means clustering to be generalized to a wide variety of data generating distributions in exponential families and clustering distortions among Bregman divergences. Getting the framework to work above exponential families is important to lift roadblocks like the lack of robustness of some population minimizers carved in their axiomatization. Current generalisations of exponential families like q-exponential families or even deformed exponential families fail at achieving the goal. In this paper, we provide a new attempt at getting the complete framework, grounded in a new generalisation of exponential families that we introduce, tempered exponential measures (TEM). TEMs keep the maximum entropy axiomatization framework of q-exponential families, but instead of normalizing the measure, normalize a dual called a co-distribution. Numerous interesting properties arise for clustering such as improved and controllable robustness for population minimizers, that keep a simple analytic form.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2021

Convolutional Deep Exponential Families

We describe convolutional deep exponential families (CDEFs) in this pape...
research
04/19/2020

Applications of Structural Statistics: Geometrical Inference in Exponential Families

Exponential families comprise a broad class of statistical models and pa...
research
10/31/2009

Learning Exponential Families in High-Dimensions: Strong Convexity and Sparsity

The versatility of exponential families, along with their attendant conv...
research
06/27/2012

Agglomerative Bregman Clustering

This manuscript develops the theory of agglomerative clustering with Bre...
research
06/08/2023

Boosting with Tempered Exponential Measures

One of the most popular ML algorithms, AdaBoost, can be derived from the...
research
10/29/2020

Staged trees are curved exponential families

Staged tree models are a discrete generalization of Bayesian networks. W...
research
03/30/2020

New exponential dispersion models for count data – properties and applications

In their fundamental paper on cubic variance functions (VFs), Letac and ...

Please sign up or login with your details

Forgot password? Click here to reset