Learning Determinantal Point Processes

02/14/2012
by   Alex Kulesza, et al.
0

Determinantal point processes (DPPs), which arise in random matrix theory and quantum physics, are natural models for subset selection problems where diversity is preferred. Among many remarkable properties, DPPs offer tractable algorithms for exact inference, including computing marginal probabilities and sampling; however, an important open question has been how to learn a DPP from labeled training data. In this paper we propose a natural feature-based parameterization of conditional DPPs, and show how it leads to a convex and efficient learning formulation. We analyze the relationship between our model and binary Markov random fields with repulsive potentials, which are qualitatively similar but computationally intractable. Finally, we apply our approach to the task of extractive summarization, where the goal is to choose a small subset of sentences conveying the most important information from a set of documents. In this task there is a fundamental tradeoff between sentences that are highly relevant to the collection as a whole, and sentences that are diverse and not repetitive. Our parameterization allows us to naturally balance these two characteristics. We evaluate our system on data from the DUC 2003/04 multi-document summarization task, achieving state-of-the-art results.

READ FULL TEXT
research
07/25/2012

Determinantal point processes for machine learning

Determinantal point processes (DPPs) are elegant probabilistic models of...
research
10/19/2016

Learning Determinantal Point Processes in Sublinear Time

We propose a new class of determinantal point processes (DPPs) which can...
research
09/08/2023

Unsupervised Multi-document Summarization with Holistic Inference

Multi-document summarization aims to obtain core information from a coll...
research
05/31/2019

Improving the Similarity Measure of Determinantal Point Processes for Extractive Multi-Document Summarization

The most important obstacles facing multi-document summarization include...
research
08/05/2017

Extractive Multi Document Summarization using Dynamical Measurements of Complex Networks

Due to the large amount of textual information available on Internet, it...
research
05/28/2021

Towards Deterministic Diverse Subset Sampling

Determinantal point processes (DPPs) are well known models for diverse s...
research
10/24/2019

Multi-Document Summarization with Determinantal Point Processes and Contextualized Representations

Emerged as one of the best performing techniques for extractive summariz...

Please sign up or login with your details

Forgot password? Click here to reset