A Pipeline for Business Intelligence and Data-Driven Root Cause Analysis on Categorical Data

11/12/2022
by   Shubham Thakar, et al.
0

Business intelligence (BI) is any knowledge derived from existing data that may be strategically applied within a business. Data mining is a technique or method for extracting BI from data using statistical data modeling. Finding relationships or correlations between the various data items that have been collected can be used to boost business performance or at the very least better comprehend what is going on. Root cause analysis (RCA) is discovering the root causes of problems or events to identify appropriate solutions. RCA can show why an event occurred and this can help in avoiding occurrences of an issue in the future. This paper proposes a new clustering + association rule mining pipeline for getting business insights from data. The results of this pipeline are in the form of association rules having consequents, antecedents, and various metrics to evaluate these rules. The results of this pipeline can help in anchoring important business decisions and can also be used by data scientists for updating existing models or while developing new ones. The occurrence of any event is explained by its antecedents in the generated rules. Hence this output can also help in data-driven root cause analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2020

Recursive Association Rule Mining

Mining frequent itemsets and association rules is an essential task with...
research
08/13/2021

Feature Recommendation for Structural Equation Model Discovery in Process Mining

Process mining techniques can help organizations to improve their operat...
research
09/09/2016

An Integrated Classification Model for Financial Data Mining

Nowadays, financial data analysis is becoming increasingly important in ...
research
06/15/2023

The Perils of Advocacy

Statisticians and data scientists find insights that help lead to better...
research
05/27/2021

A Framework for Explainable Concept Drift Detection in Process Mining

Rapidly changing business environments expose companies to high levels o...
research
12/02/2022

Why am I Waiting? Data-Driven Analysis of Waiting Times in Business Processes

Waiting times in a business process often arise when a case transitions ...
research
05/13/2021

DataExposer: Exposing Disconnect between Data and Systems

As data is a central component of many modern systems, the cause of a sy...

Please sign up or login with your details

Forgot password? Click here to reset