Visualization in the preprocessing phase: an interview study with enterprise professionals

08/21/2019
by   Alessandra Milani, et al.
0

The current information age has increasingly required organizations to become data-driven. However, analyzing and managing raw data is still a challenging part of the data mining process. Even though we can find interview studies proposing design implications or recommendations for future visualization solutions in the data mining scope, they cover the entire workflow and do not fully focus on the challenges during the preprocessing phase and on how visualization can support it. Moreover, they do not organize a final list of insights consolidating the findings of other related studies. Hence, to better understand the current practice of enterprise professionals in data mining workflows, in particular during the preprocessing phase, and how visualization supports this process, we conducted semi-structured interviews with thirteen data analysts. The discussion about the challenges and opportunities based on the responses of the interviewees resulted in a list of ten insights. This list was compared with the closest related works, improving the reliability of our findings and providing background, as a consolidated set of requirements, for future visualization research papers applied to visual data exploration in data mining. Furthermore, we provide greater details on the profile of the data analysts, the main challenges they face, and the opportunities that arise while they are engaged in data mining projects in diverse organizational areas.

READ FULL TEXT

page 3

page 8

page 9

research
03/11/2021

Data Mining and Visualization to Understand Accident-prone Areas

In this study, we present both data mining and information visualization...
research
03/28/2011

Visualization techniques for data mining of Latur district satellite imagery

This study presents a new visualization tool for classification of satel...
research
01/28/2022

3D Visualization and Spatial Data Mining for Analysis of LULC Images

The present study is an attempt made to create a new tool for the analys...
research
06/20/2019

Preprocessing Methods and Pipelines of Data Mining: An Overview

Data mining is about obtaining new knowledge from existing datasets. How...
research
03/29/2017

Bringing Salary Transparency to the World: Computing Robust Compensation Insights via LinkedIn Salary

The recently launched LinkedIn Salary product has been designed with the...
research
07/16/2008

Text Data Mining: Theory and Methods

This paper provides the reader with a very brief introduction to some of...
research
05/28/2017

Subject Specific Stream Classification Preprocessing Algorithm for Twitter Data Stream

Micro-blogging service Twitter is a lucrative source for data mining app...

Please sign up or login with your details

Forgot password? Click here to reset