Kernel Deformed Exponential Families for Sparse Continuous Attention

11/01/2021
by   Alexander Moreno, et al.
4

Attention mechanisms take an expectation of a data representation with respect to probability weights. This creates summary statistics that focus on important features. Recently, (Martins et al. 2020, 2021) proposed continuous attention mechanisms, focusing on unimodal attention densities from the exponential and deformed exponential families: the latter has sparse support. (Farinhas et al. 2021) extended this to use Gaussian mixture attention densities, which are a flexible class with dense support. In this paper, we extend this to two general flexible classes: kernel exponential families and our new sparse counterpart kernel deformed exponential families. Theoretically, we show new existence results for both kernel exponential and deformed exponential families, and that the deformed case has similar approximation capabilities to kernel exponential families. Experiments show that kernel deformed exponential families can attend to multiple compact regions of the data domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2020

Sparse and Continuous Attention Mechanisms

Exponential families are widely used in machine learning; they include m...
research
06/18/2019

Extended Plugin Densities for Curved Exponential Families

Extended plugin densities are proposed as predictive densities for curve...
research
08/04/2021

Sparse Continuous Distributions and Fenchel-Young Losses

Exponential families are widely used in machine learning; they include m...
research
12/06/2021

Approximations for STERGMs Based on Cross-Sectional Data

Temporal exponential-family random graph models (TERGMs) are a flexible ...
research
06/22/2012

Estimating Densities with Non-Parametric Exponential Families

We propose a novel approach for density estimation with exponential fami...
research
03/01/2023

E-values for k-Sample Tests With Exponential Families

We develop and compare e-variables for testing whether k samples of data...
research
10/31/2009

Learning Exponential Families in High-Dimensions: Strong Convexity and Sparsity

The versatility of exponential families, along with their attendant conv...

Please sign up or login with your details

Forgot password? Click here to reset