Pre and Post Counting for Scalable Statistical-Relational Model Discovery

10/19/2021
by   Richard Mar, et al.
0

Statistical-Relational Model Discovery aims to find statistically relevant patterns in relational data. For example, a relational dependency pattern may stipulate that a user's gender is associated with the gender of their friends. As with propositional (non-relational) graphical models, the major scalability bottleneck for model discovery is computing instantiation counts: the number of times a relational pattern is instantiated in a database. Previous work on propositional learning utilized pre-counting or post-counting to solve this task. This paper takes a detailed look at the memory and speed trade-offs between pre-counting and post-counting strategies for relational learning. A pre-counting approach computes and caches instantiation counts for a large set of relational patterns before model search. A post-counting approach computes an instantiation count dynamically on-demand for each candidate pattern generated during the model search. We describe a novel hybrid approach, tailored to relational data, that achieves a sweet spot with pre-counting for patterns involving positive relationships (e.g. pairs of users who are friends) and post-counting for patterns involving negative relationships (e.g. pairs of users who are not friends). Our hybrid approach scales model discovery to millions of data facts.

READ FULL TEXT

page 6

page 7

research
09/10/2018

Neural Latent Relational Analysis to Capture Lexical Semantic Relations in a Vector Space

Capturing the semantic relations of words in a vector space contributes ...
research
03/23/2021

On Counting Propositional Logic

We study counting propositional logic as an extension of propositional l...
research
07/28/2015

Projected Model Counting

Model counting is the task of computing the number of assignments to var...
research
10/28/2022

Some Remarks on Counting Propositional Logic

Counting propositional logic was recently introduced in relation to rand...
research
06/01/2011

The Good Old Davis-Putnam Procedure Helps Counting Models

As was shown recently, many important AI problems require counting the n...
research
09/18/2017

Relational Marginal Problems: Theory and Estimation

In the propositional setting, the marginal problem is to find a (maximum...
research
04/11/2022

Segmentation-Consistent Probabilistic Lesion Counting

Lesion counts are important indicators of disease severity, patient prog...

Please sign up or login with your details

Forgot password? Click here to reset