Model-based Exception Mining for Object-Relational Data

07/01/2018
by   Fatemeh Riahi, et al.
0

This paper is based on a previous publication [29]. Our work extends exception mining and outlier detection to the case of object-relational data. Object-relational data represent a complex heterogeneous network [12], which comprises objects of different types, links among these objects, also of different types, and attributes of these links. This special structure prohibits a direct vectorial data representation. We follow the well-established Exceptional Model Mining framework, which leverages machine learning models for exception mining: A object is exceptional to the extent that a model learned for the object data differs from a model learned for the general population. Exceptional objects can be viewed as outliers. We apply state of-the-art probabilistic modelling techniques for object-relational data that construct a graphical model (Bayesian network), which compactly represents probabilistic associations in the data. A new metric, derived from the learned object-relational model, quantifies the extent to which the individual association pattern of a potential outlier deviates from that of the whole population. The metric is based on the likelihood ratio of two parameter vectors: One that represents the population associations, and another that represents the individual associations. Our method is validated on synthetic datasets and on real-world data sets about soccer matches and movies. Compared to baseline methods, our novel transformed likelihood ratio achieved the best detection accuracy on all datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2012

Compiling Relational Database Schemata into Probabilistic Graphical Models

Instead of requiring a domain expert to specify the probabilistic depend...
research
06/15/2013

Outlying Property Detection with Numerical Attributes

The outlying property detection problem is the problem of discovering th...
research
04/21/2022

Fluctuation-based Outlier Detection

Outlier detection is an important topic in machine learning and has been...
research
06/15/2021

Robust Out-of-Distribution Detection on Deep Probabilistic Generative Models

Out-of-distribution (OOD) detection is an important task in machine lear...
research
05/10/2020

HNet: Graphical Hypergeometric Networks

Motivation: Real-world data often contain measurements with both continu...
research
07/09/2021

Redescription Model Mining

This paper introduces Redescription Model Mining, a novel approach to id...
research
11/04/2022

Neural RELAGGS

Multi-relational databases are the basis of most consolidated data colle...

Please sign up or login with your details

Forgot password? Click here to reset