Decomposable Families of Itemsets

06/16/2020
by   Nikolaj Tatti, et al.
0

The problem of selecting a small, yet high quality subset of patterns from a larger collection of itemsets has recently attracted lot of research. Here we discuss an approach to this problem using the notion of decomposable families of itemsets. Such itemset families define a probabilistic model for the data from which the original collection of itemsets has been derived from. Furthermore, they induce a special tree structure, called a junction tree, familiar from the theory of Markov Random Fields. The method has several advantages. The junction trees provide an intuitive representation of the mining results. From the computational point of view, the model provides leverage for problems that could be intractable using the entire collection of itemsets. We provide an efficient algorithm to build decomposable itemset families, and give an application example with frequency bound querying using the model. Empirical results show that our algorithm yields high quality results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2022

Pragmatic isomorphism proofs between Coq representations: application to lambda-term families

There are several ways to formally represent families of data, such as l...
research
12/28/2018

HUOPM: High Utility Occupancy Pattern Mining

Mining useful patterns from varied types of databases is an important re...
research
09/02/2021

Quantum algorithm for structure learning of Markov Random Fields

Markov random fields (MRFs) appear in many problems in machine learning ...
research
02/13/2018

Some Information Inequalities for Statistical Inference

In this paper, we first describe the generalized notion of Cramer-Rao lo...
research
02/18/2002

Nonmonotonic Reasoning, Preferential Models and Cumulative Logics

Many systems that exhibit nonmonotonic behavior have been described and ...
research
12/25/2022

Estimator selection for regression functions in exponential families with application to changepoint detection

We observe n independent pairs of random variables (W_i, Y_i) for which ...
research
03/13/2014

Scalable and Robust Construction of Topical Hierarchies

Automated generation of high-quality topical hierarchies for a text coll...

Please sign up or login with your details

Forgot password? Click here to reset