Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

02/15/2016
by   Kazuya Nakagawa, et al.
0

In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern pruning (SPP) for a class of predictive pattern mining problems. The SPP method allows us to efficiently find a superset of all the predictive patterns in the database that are needed for the optimal predictive model. The advantage of the SPP method over existing boosting-type method is that the former can find the superset by a single search over the database, while the latter requires multiple searches. The SPP method is inspired by recent development of safe feature screening. In order to extend the idea of safe feature screening into predictive pattern mining, we derive a novel pruning rule called safe pattern pruning (SPP) rule that can be used for searching over the tree defined among patterns in the database. The SPP rule has a property that, if a node corresponding to a pattern in the database is pruned out by the SPP rule, then it is guaranteed that all the patterns corresponding to its descendant nodes are never needed for the optimal predictive model. We apply the SPP method to graph mining and item-set mining problems, and demonstrate its computational advantage.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2023

Efficient Model Selection for Predictive Pattern Mining Model by Safe Pattern Pruning

Predictive pattern mining is an approach used to construct prediction mo...
research
06/26/2015

Safe Feature Pruning for Sparse High-Order Interaction Models

Taking into account high-order interactions among covariates is valuable...
research
10/03/2018

Learning sparse optimal rule fit by safe screening

In this paper, we consider linear prediction models in the form of a spa...
research
01/25/2019

An Optimized Pattern Recognition Algorithm for Anomaly Detection in IoT Environment

With the advent of large-scale heterogeneous search engines comes the pr...
research
04/17/2020

Efficient Constrained Pattern Mining Using Dynamic Item Ordering for Explainable Classification

Learning of interpretable classification models has been attracting much...
research
02/15/2016

Selective Inference Approach for Statistically Sound Predictive Pattern Mining

Discovering statistically significant patterns from databases is an impo...
research
04/26/2018

Extended Vertical Lists for Temporal Pattern Mining from Multivariate Time Series

Temporal Pattern Mining (TPM) is the problem of mining predictive comple...

Please sign up or login with your details

Forgot password? Click here to reset