Visual Feature Fusion and its Application to Support Unsupervised Clustering Tasks

01/16/2019
by   Gladys Hilasaca, et al.
14

On visual analytics applications, the concept of putting the user on the loop refers to the ability to replace heuristics by user knowledge on machine learning and data mining tasks. On supervised tasks, the user engagement occurs via the manipulation of the training data. However, on unsupervised tasks, the user involvement is limited to changes in the algorithm parametrization or the input data representation, also known as features. Depending on the application domain, different types of features can be extracted from the raw data. Therefore, the result of unsupervised algorithms heavily depends on the type of employed feature. Since there is no perfect feature extractor, combining different features have been explored in a process called feature fusion. The feature fusion is straightforward when the machine learning or data mining task has a cost function. However, when such a function does not exist, user support for combination needs to be provided otherwise the process is impractical. In this paper, we present a novel feature fusion approach that uses small data samples to allows users not only to effortless control the combination of different feature sets but also to interpret the attained results. The effectiveness of our approach is confirmed by a comprehensive set of qualitative and quantitative tests, opening up different possibilities of user-guided analytical scenarios not covered yet. The ability of our approach to providing real-time feedback for the feature fusion is exploited on the context of unsupervised clustering techniques, where the composed groups reflect the semantics of the feature combination.

READ FULL TEXT

page 10

page 11

page 12

page 13

page 14

research
01/13/2020

Multi-Sensor Data and Knowledge Fusion – A Proposal for a Terminology Definition

Fusion is a common tool for the analysis and utilization of available da...
research
10/26/2020

Quality Prediction in Interlinked Manufacturing Processes based on Supervised & Unsupervised Machine Learning

In the context of a rolling mill case study, this paper presents a metho...
research
08/24/2021

Hybrid Multisource Feature Fusion for the Text Clustering

The text clustering technique is an unsupervised text mining method whic...
research
07/13/2023

Student Assessment in Cybersecurity Training Automated by Pattern Mining and Clustering

Hands-on cybersecurity training allows students and professionals to pra...
research
05/31/2013

Privileged Information for Data Clustering

Many machine learning algorithms assume that all input samples are indep...
research
11/22/2017

Identifying user habits through data mining on call data records

In this paper we propose a framework for identifying patterns and regula...
research
04/23/2010

STORM - A Novel Information Fusion and Cluster Interpretation Technique

Analysis of data without labels is commonly subject to scrutiny by unsup...

Please sign up or login with your details

Forgot password? Click here to reset