Ultrametric Component Analysis with Application to Analysis of Text and of Emotion

09/14/2013
by   Fionn Murtagh, et al.
0

We review the theory and practice of determining what parts of a data set are ultrametric. It is assumed that the data set, to begin with, is endowed with a metric, and we include discussion of how this can be brought about if a dissimilarity, only, holds. The basis for part of the metric-endowed data set being ultrametric is to consider triplets of the observables (vectors). We develop a novel consensus of hierarchical clusterings. We do this in order to have a framework (including visualization and supporting interpretation) for the parts of the data that are determined to be ultrametric. Furthermore a major objective is to determine locally ultrametric relationships as opposed to non-local ultrametric relationships. As part of this work, we also study a particular property of our ultrametricity coefficient, namely, it being a function of the difference of angles of the base angles of the isosceles triangle. This work is completed by a review of related work, on consensus hierarchies, and of a major new application, namely quantifying and interpreting the emotional content of narrative.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2022

The cluster structure function

For each partition of a data set into a given number of parts there is a...
research
04/03/2022

On Angles in Higher Order Brillouin Tessellations and Related Tilings in the Plane

For a locally finite set in ℝ^2, the order-k Brillouin tessellations for...
research
11/03/2021

Local Structure and effective Dimensionality of Time Series Data Sets

The goal of this paper is to develop novel tools for understanding the l...
research
06/23/2020

ABID: Angle Based Intrinsic Dimensionality

The intrinsic dimensionality refers to the “true” dimensionality of the ...
research
09/13/2021

Multiple Linear Regression and Correlation: A Geometric Analysis

In this review article we consider linear regression analysis from a geo...
research
02/23/2021

The SmartSHARK Repository Mining Data

The SmartSHARK repository mining data is a collection of rich and detail...
research
05/21/2018

A Text Analysis of Federal Reserve meeting minutes

Recent developments in monetary policy by the Federal Reserve has create...

Please sign up or login with your details

Forgot password? Click here to reset