Heaps' Law and Vocabulary Richness in the History of Classical Music Harmony

04/09/2021
by   Marc Serra-Peralta, et al.
0

Music is a fundamental human construct, and harmony provides the building blocks of musical language. Using the Kunstderfuge corpus of classical music, we analyze the historical evolution of the richness of harmonic vocabulary of 76 classical composers, covering almost 6 centuries. Such corpus comprises about 9500 pieces, resulting in more than 5 million tokens of music codewords. The fulfilment of Heaps' law for the relation between the size of the harmonic vocabulary of a composer (in codeword types) and the total length of his works (in codeword tokens), with an exponent around 0.35, allows us to define a relative measure of vocabulary richness that has a transparent interpretation. When coupled with the considered corpus, this measure allows us to quantify harmony richness across centuries, unveiling a clear increasing linear trend. In this way, we are able to rank the composers in terms of richness of vocabulary, in the same way as for other related metrics, such as entropy. We find that the latter is particularly highly correlated with our measure of richness. Our approach is not specific for music and can be applied to other systems built by tokens of different types, as for instance natural language.

READ FULL TEXT
research
01/07/2021

Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs

To apply neural sequence models such as the Transformers to music genera...
research
08/06/2023

Quantifying the evolution of harmony and novelty in western classical music

Music is a complex socio-cultural construct, which fascinates researcher...
research
02/11/2022

MusIAC: An extensible generative framework for Music Infilling Applications with multi-level Control

We present a novel music generation framework for music infilling, with ...
research
01/07/2020

Heaps' law and Heaps functions in tagged texts: Evidences of their linguistic relevance

We study the relationship between vocabulary size and text length in a c...
research
02/08/2019

Machine learning and chord based feature engineering for genre prediction in popular Brazilian music

Music genre can be hard to describe: many factors are involved, such as ...
research
04/21/2018

Taylor's law for Human Linguistic Sequences

Taylor's law describes the fluctuation characteristics underlying a syst...
research
12/31/2018

Types, Tokens, and Hapaxes: A New Heap's Law

Heap's Law states that in a large enough text corpus, the number of type...

Please sign up or login with your details

Forgot password? Click here to reset