Visualizing class specific heterogeneous tendencies in categorical data

11/05/2018
by   Mariko Takagishi, et al.
0

In multiple correspondence analysis, both individuals (observations) and categories can be represented in a biplot. In this biplot, relationships between categories, between individuals, as well as the associations between individuals and categories, are depicted jointly. It can be useful to add information regarding the individuals to enhance interpretation. Such additional information can consist, for example, of a set of categorical variables for which the interdependencies are not of immediate concern, but that might assist in interpreting the plot, and in particular, with respect to the relationships between individuals and categories. In this paper, we propose a new method for adding such additional information. We introduce a multiple set cluster correspondence analysis approach that finds clusters specific for classes, defined as subsets of the data corresponding to the categories of the additional variables. Our method can be used to construct a biplot that visualizes heterogeneous tendencies of the individuals, as well as their relationship with respect to the original categorical variables. We investigate the performance of the proposed method through a simulation study and we apply it to a data set regarding road accidents in the United Kingdom.

READ FULL TEXT
research
02/28/2007

Consumer Profile Identification and Allocation

We propose an easy-to-use methodology to allocate one of the groups whic...
research
08/28/2023

Categorical data analysis using discretization of continuous variables to investigate associations in marine ecosystems

Understanding and predicting interactions between predators and prey and...
research
07/12/2023

Multiple Correspondence and Proportional Analysis of Vaccination Rate Among Healthcare Personnel of MINSA

DataProAnalytica is a powerful application for analyzing vaccination dat...
research
08/26/2019

Sufficient Representations for Categorical Variables

Many learning algorithms require categorical data to be transformed into...
research
09/27/2020

A grammar of graphics framework for generalized parallel coordinate plots

Parallel coordinate plots (PCP) are a useful tool in exploratory data an...
research
11/13/2019

Generating Stereotypes Automatically For Complex Categorical Features

In the context of stereotypes creation for recommender systems, we found...
research
07/29/2019

ICE: An Interactive Configuration Explorer for High Dimensional Categorical Parameter Spaces

There are many applications where users seek to explore the impact of th...

Please sign up or login with your details

Forgot password? Click here to reset