Towards objectively evaluating the quality of generated medical summaries

04/09/2021

∙

We propose a method for evaluating the quality of generated text by asking evaluators to count facts, and computing precision, recall, f-score, and accuracy from the raw counts. We believe this approach leads to a more objective and easier to reproduce evaluation. We apply this to the task of medical report summarisation, where measuring objective quality and accuracy is of paramount importance.

READ FULL TEXT

Towards objectively evaluating the quality of generated medical summaries

Sign in with Google

Consider DeepAI Pro