Tangles: a new paradigm for clusters and types

06/03/2020
by   Reinhard Diestel, et al.
0

Traditional clustering identifies groups of objects that share certain qualities. Tangles do the converse: they identify groups of qualities that often occur together. They can thereby discover, relate, and structure types: of behaviour, political views, texts, or viruses. If desired, tangles can also be used for direct clustering of objects. They offer a precise, quantitative paradigm suited particularly to fuzzy clusters, since they do not require any `hard' assignments of objects to the clusters they collectively form. This is a draft of the introductory chapter of a book I am preparing on the application of tangles in the empirical sciences. The purpose of posting this draft early is to give authors of tangle application papers a generic reference for the basic guiding principles underlying tangle applications outside mathematics, so that in their own papers they can concentrate on the ideas specific to their particular application rather than having to repeat the generic story each time. The text starts with three separate generic introductions to tangles in the natural sciences, in the social sciences, and in data science including machine learning. It then gives a short informal description of the abstract notion of tangles that encompasses all these potential applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/01/2018

MaxMin Linear Initialization for Fuzzy C-Means

Clustering is an extensive research area in data science. The aim of clu...
research
02/17/2015

Computational Curiosity (A Book Draft)

This book discusses computational curiosity, from the psychology of curi...
research
04/12/2018

Clustering via Boundary Erosion

Clustering analysis identifies samples as groups based on either their m...
research
09/29/2020

Towards decolonising computational sciences

This article sets out our perspective on how to begin the journey of dec...
research
08/06/2010

Data visualization in political and social sciences

The basic objective of data visualization is to provide an efficient gra...
research
02/23/2021

CAC: A Clustering Based Framework for Classification

In data containing heterogeneous subpopulations, classification performa...
research
12/17/2019

Garbage In, Garbage Out? Do Machine Learning Application Papers in Social Computing Report Where Human-Labeled Training Data Comes From?

Many machine learning projects for new application areas involve teams o...

Please sign up or login with your details

Forgot password? Click here to reset