CUED at ProbSum 2023: Hierarchical Ensemble of Summarization Models

06/08/2023
by   Potsawee Manakul, et al.
0

In this paper, we consider the challenge of summarizing patients' medical progress notes in a limited data setting. For the Problem List Summarization (shared task 1A) at the BioNLP Workshop 2023, we demonstrate that Clinical-T5 fine-tuned to 765 medical clinic notes outperforms other extractive, abstractive and zero-shot baselines, yielding reasonable baseline systems for medical note summarization. Further, we introduce Hierarchical Ensemble of Summarization Models (HESM), consisting of token-level ensembles of diverse fine-tuned Clinical-T5 models, followed by Minimum Bayes Risk (MBR) decoding. Our HESM approach lead to a considerable summarization performance boost, and when evaluated on held-out challenge data achieved a ROUGE-L of 32.77, which was the best-performing system at the top of the shared task leaderboard.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2018

Extractive Summarization of EHR Discharge Notes

Patient summarization is essential for clinicians to provide coordinated...
research
10/24/2020

Improving Zero and Few-Shot Abstractive Summarization with Intermediate Fine-tuning and Data Augmentation

Models pretrained with self-supervised objectives on large text corpora ...
research
05/24/2023

Neural Summarization of Electronic Health Records

Hospital discharge documentation is among the most essential, yet time-c...
research
06/07/2023

IUTEAM1 at MEDIQA-Chat 2023: Is simple fine tuning effective for multilayer summarization of clinical conversations?

Clinical conversation summarization has become an important application ...
research
05/27/2023

An Investigation of Evaluation Metrics for Automated Medical Note Generation

Recent studies on automatic note generation have shown that doctors can ...

Please sign up or login with your details

Forgot password? Click here to reset