Efficient Model Selection for Predictive Pattern Mining Model by Safe Pattern Pruning

06/23/2023
by   Takumi Yoshida, et al.
0

Predictive pattern mining is an approach used to construct prediction models when the input is represented by structured data, such as sets, graphs, and sequences. The main idea behind predictive pattern mining is to build a prediction model by considering substructures, such as subsets, subgraphs, and subsequences (referred to as patterns), present in the structured data as features of the model. The primary challenge in predictive pattern mining lies in the exponential growth of the number of patterns with the complexity of the structured data. In this study, we propose the Safe Pattern Pruning (SPP) method to address the explosion of pattern numbers in predictive pattern mining. We also discuss how it can be effectively employed throughout the entire model building process in practical data analysis. To demonstrate the effectiveness of the proposed method, we conduct numerical experiments on regression and classification problems involving sets, graphs, and sequences.

READ FULL TEXT

page 1

page 2

page 8

page 9

page 10

page 11

page 25

page 26

research
02/15/2016

Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

In this paper we study predictive pattern mining problems where the goal...
research
11/26/2011

Pattern-Based Classification: A Unifying Perspective

The use of patterns in predictive models is a topic that has received a ...
research
01/23/2022

Dichotomic Pattern Mining with Applications to Intent Prediction from Semi-Structured Clickstream Datasets

We introduce a pattern mining framework that operates on semi-structured...
research
02/24/2021

HiPaR: Hierarchical Pattern-aided Regression

We introduce HiPaR, a novel pattern-aided regression method for tabular ...
research
12/19/2019

FIBS: A Generic Framework for Classifying Interval-based Temporal Sequences

We study the problem of classification of interval-based temporal sequen...
research
06/17/2022

KitBit: A New AI Model for Solving Intelligence Tests and Numerical Series

The resolution of intelligence tests, in particular numerical sequences,...
research
12/17/2021

cgSpan: Closed Graph-Based Substructure Pattern Mining

gSpan is a popular algorithm for mining frequent subgraphs. cgSpan (clos...

Please sign up or login with your details

Forgot password? Click here to reset