Checklist Models for Improved Output Fluency in Piano Fingering Prediction

09/12/2022
by   Nikita Srivatsan, et al.
0

In this work we present a new approach for the task of predicting fingerings for piano music. While prior neural approaches have often treated this as a sequence tagging problem with independent predictions, we put forward a checklist system, trained via reinforcement learning, that maintains a representation of recent predictions in addition to a hidden state, allowing it to learn soft constraints on output structure. We also demonstrate that by modifying input representations – which in prior work using neural models have often taken the form of one-hot encodings over individual keys on the piano – to encode relative position on the keyboard to the prior note instead, we can achieve much better performance. Additionally, we reassess the use of raw per-note labeling precision as an evaluation metric, noting that it does not adequately measure the fluency, i.e. human playability, of a model's output. To this end, we compare methods across several statistics which track the frequency of adjacent finger predictions that while independently reasonable would be physically challenging to perform in sequence, and implement a reinforcement learning strategy to minimize these as part of our training loss. Finally through human expert evaluation, we demonstrate significant gains in performability directly attributable to improvements with respect to these metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/18/2012

Learning the Experts for Online Sequence Prediction

Online sequence prediction is the problem of predicting the next element...
research
07/14/2020

Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP

Multi-task reinforcement learning is a rich paradigm where information f...
research
04/07/2018

Scalable Sentiment for Sequence-to-sequence Chatbot Response with Performance Analysis

Conventional seq2seq chatbot models only try to find the sentences with ...
research
08/23/2018

LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations

Reinforcement learning approaches have long appealed to the data managem...
research
09/30/2020

Learning Rewards from Linguistic Feedback

We explore unconstrained natural language feedback as a learning signal ...
research
12/06/2018

Knockoff Nets: Stealing Functionality of Black-Box Models

Machine Learning (ML) models are increasingly deployed in the wild to pe...
research
04/16/2018

Learning How to Self-Learn: Enhancing Self-Training Using Neural Reinforcement Learning

Self-training is a useful strategy for semi-supervised learning, leverag...

Please sign up or login with your details

Forgot password? Click here to reset