Human Evaluation and Correlation with Automatic Metrics in Consultation Note Generation

04/01/2022
by   Francesco Moramarco, et al.
0

In recent years, machine learning models have rapidly become better at generating clinical consultation notes; yet, there is little work on how to properly evaluate the generated consultation notes to understand the impact they may have on both the clinician using them and the patient's clinical safety. To address this we present an extensive human evaluation study of consultation notes where 5 clinicians (i) listen to 57 mock consultations, (ii) write their own notes, (iii) post-edit a number of automatically generated notes, and (iv) extract all the errors, both quantitative and qualitative. We then carry out a correlation study with 18 automatic quality metrics and the human judgements. We find that a simple, character-based Levenshtein distance metric performs on par if not better than common model-based metrics like BertScore. All our findings and annotations are open-sourced.

READ FULL TEXT

page 3

page 4

page 8

page 13

page 14

page 15

page 16

research
04/09/2021

A preliminary study on evaluating Consultation Notes with Post-Editing

Automatic summarisation has the potential to aid physicians in streamlin...
research
11/17/2022

Consultation Checklists: Standardising the Human Evaluation of Medical Note Generation

Evaluating automatically generated text is generally hard due to the inh...
research
01/13/2022

Beyond chord vocabularies: Exploiting pitch-relationships in a chord estimation metric

Chord estimation metrics treat chord labels as independent of one anothe...
research
05/27/2023

An Investigation of Evaluation Metrics for Automated Medical Note Generation

Recent studies on automatic note generation have shown that doctors can ...
research
08/18/2020

Generating Music with a Self-Correcting Non-Chronological Autoregressive Model

We describe a novel approach for generating music using a self-correctin...
research
05/08/2022

Write It Like You See It: Detectable Differences in Clinical Notes By Race Lead To Differential Model Recommendations

Clinical notes are becoming an increasingly important data source for ma...
research
05/04/2020

Generating SOAP Notes from Doctor-Patient Conversations

Following each patient visit, physicians must draft detailed clinical su...

Please sign up or login with your details

Forgot password? Click here to reset