Integrating Prior Knowledge in Mixed Initiative Social Network Clustering

05/06/2020
by   Alexis Pister, et al.
0

We propose a new paradigm—called PK-clustering—to help social scientists create meaningful clusters in social networks. Many clustering algorithms exist but most social scientists find them difficult to understand, and tools do not provide any guidance to choose algorithms, or to evaluate results taking into account the prior knowledge of the scientists. Our work introduces a new clustering paradigm and a visual analytics user interface that address this issue. It is based on a process that 1) captures the prior knowledge of the scientists as a set of incomplete clusters, 2) runs multiple clustering algorithms (similarly to clustering ensemble methods), 3) visualizes the results of all the algorithms ranked and summarized by how well each algorithm matches the prior knowledge, 5) evaluates the consensus between user-selected algorithms and 6) allows users to review details and iteratively update the acquired knowledge. We describe our paradigm using an initial functional prototype, then provide two examples of use and early feedback from social scientists. We believe our clustering paradigm offers a novel constructive method to iteratively build knowledge while avoiding being overly influenced by the results of often-randomly selected black-box clustering algorithms.

READ FULL TEXT
research
11/20/2019

Discovering New Intents via Constrained Deep Adaptive Clustering with Cluster Refinement

Identifying new user intents is an essential task in the dialogue system...
research
07/12/2018

Decentralized Clustering on Compressed Data without Prior Knowledge of the Number of Clusters

In sensor networks, it is not always practical to set up a fusion center...
research
04/09/2018

Clustrophile 2: Guided Visual Clustering Analysis

Data clustering is a common unsupervised learning method frequently used...
research
11/19/2018

An Influence-based Clustering Model on Twitter

This paper introduces a temporal framework for detecting and clustering ...
research
04/21/2015

Visual analytics in FCA-based clustering

Visual analytics is a subdomain of data analysis which combines both hum...
research
06/24/2021

A review of systematic selection of clustering algorithms and their evaluation

Data analysis plays an indispensable role for value creation in industry...
research
01/25/2022

SQRQuerier: A Visual Querying Framework for Cross-national Survey Data Recycling

Public opinion surveys constitute a powerful tool to study peoples' atti...

Please sign up or login with your details

Forgot password? Click here to reset