Statistically Significant Discriminative Patterns Searching

06/02/2019
by   Hoang Son Pham, et al.
0

Discriminative pattern mining is an essential task of data mining. This task aims to discover patterns which occur more frequently in a class than other classes in a class-labeled dataset. This type of patterns is valuable in various domains such as bioinformatics, data classification. In this paper, we propose a novel algorithm, named SSDPS, to discover patterns in two-class datasets. The SSDPS algorithm owes its efficiency to an original enumeration strategy of the patterns, which allows to exploit some degrees of anti-monotonicity on the measures of discriminance and statistical significance. Experimental results demonstrate that the performance of the SSDPS algorithm is better than others. In addition, the number of generated patterns is much less than the number of other algorithms. Experiment on real data also shows that SSDPS efficiently detects multiple SNPs combinations in genetic data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2019

Towards Efficient Discriminative Pattern Mining in Hybrid Domains

Discriminative pattern mining is a data mining task in which we find pat...
research
08/24/2020

Statistically Significant Pattern Mining with Ordinal Utility

Statistically significant patterns mining (SSPM) is an essential and cha...
research
04/24/2023

Towards Top-K Non-Overlapping Sequential Patterns

Sequential pattern mining (SPM) has excellent prospects and application ...
research
08/24/2015

Searching for significant patterns in stratified data

Significant pattern mining, the problem of finding itemsets that are sig...
research
11/16/2015

A genetic algorithm to discover flexible motifs with support

Finding repeated patterns or motifs in a time series is an important uns...
research
07/09/2021

Redescription Model Mining

This paper introduces Redescription Model Mining, a novel approach to id...
research
11/16/2020

Improving Scalability of Contrast Pattern Mining for Network Traffic Using Closed Patterns

Contrast pattern mining (CPM) aims to discover patterns whose support in...

Please sign up or login with your details

Forgot password? Click here to reset