Human-Guided Data Exploration

04/09/2018
by   Andreas Henelius, et al.
0

The outcome of the explorative data analysis (EDA) phase is vital for successful data analysis. EDA is more effective when the user interacts with the system used to carry out the exploration. In the recently proposed paradigm of iterative data mining the user controls the exploration by inputting knowledge in the form of patterns observed during the process. The system then shows the user views of the data that are maximally informative given the user's current knowledge. Although this scheme is good at showing surprising views of the data to the user, there is a clear shortcoming: the user cannot steer the process. In many real cases we want to focus on investigating specific questions concerning the data. This paper presents the Human Guided Data Exploration framework, generalising previous research. This framework allows the user to incorporate existing knowledge into the exploration process, focus on exploring a subset of the data, and compare different complex hypotheses concerning relations in the data. The framework utilises a computationally efficient constrained randomisation scheme. To showcase the framework, we developed a free open-source tool, using which the empirical evaluation on real-world datasets was carried out. Our evaluation shows that the ability to focus on particular subsets and being able to compare hypotheses are important additions to the interactive iterative data mining process.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2018

Human-guided data exploration using randomisation

An explorative data analysis system should be aware of what the user alr...
research
05/07/2019

Guided Visual Exploration of Relations in Data Sets

Efficient explorative data analysis systems must take into account both ...
research
02/07/2017

Learning what matters - Sampling interesting patterns

In the field of exploratory data mining, local structure in data can be ...
research
10/23/2017

Interactive Visual Data Exploration with Subjective Feedback: An Information-Theoretic Approach

Visual exploration of high-dimensional real-valued datasets is a fundame...
research
09/30/2022

A Functional Model For Information Exploration Systems

Information exploration tasks are inherently complex, ill-structured, an...
research
07/09/2019

Computer-Aided Data Mining: Automating a Novel Knowledge Discovery and Data Mining Process Model for Metabolomics

This work presents MeKDDaM-SAGA, computer-aided automation software for ...
research
08/19/2022

CohortVA: A Visual Analytic System for Interactive Exploration of Cohorts based on Historical Data

In history research, cohort analysis seeks to identify social structures...

Please sign up or login with your details

Forgot password? Click here to reset