An information theoretic model for summarization, and some basic results

01/18/2019
by   Eric Graves, et al.
0

A basic information theoretic model for summarization is formulated. Here summarization is considered as the process of taking a report of v binary objects, and producing from it a j element subset that captures most of the important features of the original report, with importance being defined via an arbitrary set function endemic to the model. The loss of information is then measured by a weight average of variational distances, which we term the semantic loss. Our results include both cases where the probability distribution generating the v-length reports are known and unknown. In the case where it is known, our results demonstrate how to construct summarizers which minimize the semantic loss. For the case where the probability distribution is unknown, we show how to construct summarizers whose semantic loss when averaged uniformly over all possible distribution converges to the minimum.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2020

Audio Summarization with Audio Features and Probability Distribution Divergence

The automatic summarization of multimedia sources is an important task t...
research
04/08/2021

An Information-Theoretic Proof of a Finite de Finetti Theorem

A finite form of de Finetti's representation theorem is established usin...
research
09/23/2016

Estimating Probability Distributions using "Dirac" Kernels (via Rademacher-Walsh Polynomial Basis Functions)

In many applications (in particular information systems, such as pattern...
research
06/09/2017

Assessing the Performance of Deep Learning Algorithms for Newsvendor Problem

In retailer management, the Newsvendor problem has widely attracted atte...
research
05/23/2019

On the Average Case of MergeInsertion

MergeInsertion, also known as the Ford-Johnson algorithm, is a sorting a...
research
11/12/2018

Joint Probability Distribution of Prediction Errors of ARIMA

Producing probabilistic guarantee for several steps of a predicted signa...
research
12/06/2021

Prototypical Model with Novel Information-theoretic Loss Function for Generalized Zero Shot Learning

Generalized zero shot learning (GZSL) is still a technical challenge of ...

Please sign up or login with your details

Forgot password? Click here to reset