Classification of cancer pathology reports: a large-scale comparative study

06/29/2020
by   Stefano Martina, et al.
0

We report about the application of state-of-the-art deep learning techniques to the automatic and interpretable assignment of ICD-O3 topography and morphology codes to free-text cancer reports. We present results on a large dataset (more than 80 000 labeled and 1 500 000 unlabeled anonymized reports written in Italian and collected from hospitals in Tuscany over more than a decade) and with a large number of classes (134 morphological classes and 61 topographical classes). We compare alternative architectures in terms of prediction accuracy and interpretability and show that our best model achieves a multiclass accuracy of 90.3 morphology type assignment. We found that in this context hierarchical models are not better than flat models and that an element-wise maximum aggregator is slightly better than attentive models on site classification. Moreover, the maximum aggregator offers a way to interpret the classification process.

READ FULL TEXT
research
08/28/2020

Hierarchical Deep Learning Classification of Unstructured Pathology Reports to Automate ICD-O Morphology Grading

Timely cancer reporting data are required in order to understand the imp...
research
08/28/2020

Hierarchical Deep Learning Ensemble to Automate the Classification of Breast Cancer Pathology Reports by ICD-O Topography

Like most global cancer registries, the National Cancer Registry in Sout...
research
03/05/2019

Automatic Classification of Pathology Reports using TF-IDF Features

A Pathology report is arguably one of the most important documents in me...
research
09/10/2020

Why I'm not Answering

Safe deployment of deep learning systems in critical real world applicat...
research
11/08/2021

A Comparison of Deep Learning Architectures for Optical Galaxy Morphology Classification

The classification of galaxy morphology plays a crucial role in understa...
research
07/12/2020

A Comparative Study on Polyp Classification using Convolutional Neural Networks

Colorectal cancer is the third most common cancer diagnosed in both men ...
research
06/06/2023

Using Screenshot Attachments in Issue Reports for Triaging

In previous work, we deployed IssueTAG, which uses the texts present in ...

Please sign up or login with your details

Forgot password? Click here to reset