Performance Comparison of Balanced and Unbalanced Cancer Datasets using Pre-Trained Convolutional Neural Network

12/10/2020
by   Ali Narin, et al.
0

Cancer disease is one of the leading causes of death all over the world. Breast cancer, which is a common cancer disease especially in women, is quite common. The most important tool used for early detection of this cancer type, which requires a long process to establish a definitive diagnosis, is histopathological images taken by biopsy. These obtained images are examined by pathologists and a definitive diagnosis is made. It is quite common to detect this process with the help of a computer. Detection of benign or malignant tumors, especially by using data with different magnification rates, takes place in the literature. In this study, two different balanced and unbalanced study groups have been formed by using the histopathological data in the BreakHis data set. We have examined how the performances of balanced and unbalanced data sets change in detecting tumor type. In conclusion, in the study performed using the InceptionV3 convolution neural network model, 93.55 accuracy, 99.19 balanced data, while 89.75 values have been obtained for unbalanced data. According to the results obtained in two different studies, the balance of the data increases the overall performance as well as the detection performance of both benign and malignant tumors. It can be said that the model trained with the help of data sets created in a balanced way will give pathology specialists higher and accurate results.

READ FULL TEXT

page 1

page 2

research
09/15/2023

Improved Breast Cancer Diagnosis through Transfer Learning on Hematoxylin and Eosin Stained Histology Images

Breast cancer is one of the leading causes of death for women worldwide....
research
12/21/2020

Natural vs Balanced Distribution in Deep Learning on Whole Slide Images for Cancer Detection

The class distribution of data is one of the factors that regulates the ...
research
12/10/2020

Effect of Different Batch Size Parameters on Predicting of COVID19 Cases

The new coronavirus 2019, also known as COVID19, is a very serious epide...
research
04/09/2009

Online prediction of ovarian cancer

In this paper we apply computer learning methods to diagnosing ovarian c...
research
10/17/2017

CancerLinker: Explorations of Cancer Study Network

Interactive visualization tools are highly desirable to biologist and ca...
research
07/11/2020

Decoupling Inherent Risk and Early Cancer Signs in Image-based Breast Cancer Risk Models

The ability to accurately estimate risk of developing breast cancer woul...
research
03/08/2022

An Efficient Polyp Segmentation Network

Cancer is a disease that occurs as a result of uncontrolled division and...

Please sign up or login with your details

Forgot password? Click here to reset