O^2PF: Oversampling via Optimum-Path Forest for Breast Cancer Detection

by   Leandro Aparecido Passos, et al.

Breast cancer is among the most deadly diseases, distressing mostly women worldwide. Although traditional methods for detection have presented themselves as valid for the task, they still commonly present low accuracies and demand considerable time and effort from professionals. Therefore, a computer-aided diagnosis (CAD) system capable of providing early detection becomes hugely desirable. In the last decade, machine learning-based techniques have been of paramount importance in this context, since they are capable of extracting essential information from data and reasoning about it. However, such approaches still suffer from imbalanced data, specifically on medical issues, where the number of healthy people samples is, in general, considerably higher than the number of patients. Therefore this paper proposes the O^2PF, a data oversampling method based on the unsupervised Optimum-Path Forest Algorithm. Experiments conducted over the full oversampling scenario state the robustness of the model, which is compared against three well-established oversampling methods considering three breast cancer and three general-purpose tasks for medical issues datasets.



There are no comments yet.


page 1

page 2

page 3

page 4


An Analysis of the Methods Employed for Breast Cancer Diagnosis

Breast cancer research over the last decade has been tremendous. The gro...

Comparing Methods for segmentation of Microcalcification Clusters in Digitized Mammograms

The appearance of microcalcifications in mammograms is one of the early ...

Deep Learning-based Mammogram Classification using Small Dataset

Breast Cancer is one of the most diagnosed cancer and the leading cause ...

HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridization

Breast cancer is the most common malignancy in women, being responsible ...

Weighted multi-level deep learning analysis and framework for processing breast cancer WSIs

Prevention and early diagnosis of breast cancer (BC) is an essential pre...

Method and System for Image Analysis to Detect Cancer

Breast cancer is the most common cancer and is the leading cause of canc...

Application of Transfer Learning and Ensemble Learning in Image-level Classification for Breast Histopathology

Background: Breast cancer has the highest prevalence in women globally. ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.