The role of semantics in mining frequent patterns from knowledge bases in description logics with rules

03/13/2010
by   Joanna Jozefowska, et al.
0

We propose a new method for mining frequent patterns in a language that combines both Semantic Web ontologies and rules. In particular we consider the setting of using a language that combines description logics with DL-safe rules. This setting is important for the practical application of data mining to the Semantic Web. We focus on the relation of the semantics of the representation formalism to the task of frequent pattern discovery, and for the core of our method, we propose an algorithm that exploits the semantics of the combined knowledge base. We have developed a proof-of-concept data mining implementation of this. Using this we have empirically shown that using the combined knowledge base to perform semantic tests can make data mining faster by pruning useless candidate patterns before their evaluation. We have also shown that the quality of the set of patterns produced may be improved: the patterns are more compact, and there are fewer patterns. We conclude that exploiting the semantics of a chosen representation formalism is key to the design and application of (onto-)relational frequent pattern discovery methods. Note: To appear in Theory and Practice of Logic Programming (TPLP)

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2012

Abstract Representations and Frequent Pattern Discovery

We discuss the frequent pattern mining problem in a general setting. Fro...
research
08/17/2017

When data mining meets optimization: A case study on the quadratic assignment problem

This paper presents a hybrid approach called frequent pattern based sear...
research
10/26/2021

cgSpan: Pattern Mining in Conceptual Graphs

Conceptual Graphs (CGs) are a graph-based knowledge representation forma...
research
04/27/2018

Modified Apriori Graph Algorithm for Frequent Pattern Mining

Web Usage Mining is an application of Data Mining Techniques to discover...
research
01/11/2021

The Semantic Adjacency Criterion in Time Intervals Mining

Frequent temporal patterns discovered in time-interval-based multivariat...
research
10/20/2017

Verb Pattern: A Probabilistic Semantic Representation on Verbs

Verbs are important in semantic understanding of natural language. Tradi...
research
06/24/2019

Query-driven PAC-Learning for Reasoning

We consider the problem of learning rules from a data set that support a...

Please sign up or login with your details

Forgot password? Click here to reset