Clustering of football players based on performance data and aggregated clustering validity indexes

04/20/2022
by   Serhat Akhanli, et al.
0

We analyse football (soccer) player performance data with mixed type variables from the 2014-15 season of eight European major leagues. We cluster these data based on a tailor-made dissimilarity measure. In order to decide between the many available clustering methods and to choose an appropriate number of clusters, we use the approach by Akhanli and Hennig (2020). This is based on several validation criteria that refer to different desirable characteristics of a clustering. These characteristics are chosen based on the aim of clustering, and this allows to define a suitable validation index as weighted average of calibrated individual indexes measuring the desirable features. We derive two different clusterings. The first one is a partition of the data set into major groups of essentially different players, which can be used for the analysis of a team's composition. The second one divides the data set into many small clusters (with 10 players on average), which can be used for finding players with a very similar profile to a given player. It is discussed in depth what characteristics are desirable for these clusterings. Weighting the criteria for the second clustering is informed by a survey of football experts.

READ FULL TEXT
research
02/05/2020

Comparing clusterings and numbers of clusters by aggregation of calibrated clustering validity indexes

A key issue in cluster analysis is the choice of an appropriate clusteri...
research
12/13/2016

Defensive Player Classification in the National Basketball Association

The National Basketball Association(NBA) has expanded their data gatheri...
research
09/09/2016

Measuring Player's Behaviour Change over Time in Public Goods Game

An important issue in public goods game is whether player's behaviour ch...
research
06/19/2016

Clustering with a Reject Option: Interactive Clustering as Bayesian Prior Elicitation

A good clustering can help a data analyst to explore and understand a da...
research
10/24/2019

Characterization and Development of Average Silhouette Width Clustering

The purpose of this paper is to introduced a new clustering methodology....
research
07/08/2016

Document Clustering Games in Static and Dynamic Scenarios

In this work we propose a game theoretic model for document clustering. ...
research
02/14/2018

PlayeRank: Multi-dimensional and role-aware rating of soccer player performance

The problem of rating the performance of soccer players is attracting th...

Please sign up or login with your details

Forgot password? Click here to reset