Data Mining Approach to Analyze Covid19 Dataset of Brazilian Patients

08/26/2020
by   Josimar E. Chire Saire, et al.
0

The pandemic originated by coronavirus(covid-19), name coined by World Health Organization during the first month in 2020. Actually, almost all the countries presented covid19 positive cases and governments are choosing different health policies to stop the infection and many research groups are working on patients data to understand the virus, at the same time scientists are looking for a vacuum to enhance imnulogy system to tack covid19 virus. One of top countries with more infections is Brazil, until August 11 had a total of 3,112,393 cases. Research Foundation of Sao Paulo State(Fapesp) released a dataset, it was an innovative in collaboration with hospitals(Einstein, Sirio-Libanes), laboratory(Fleury) and Sao Paulo University to foster reseach on this trend topic. The present paper presents an exploratory analysis of the datasets, using a Data Mining Approach, and some inconsistencies are found, i.e. NaN values, null references values for analytes, outliers on results of analytes, encoding issues. The results were cleaned datasets for future studies, but at least a 20% of data were discarded because of non numerical, null values and numbers out of reference range.

READ FULL TEXT
research
05/19/2020

What country, university or research institute, performed the best on COVID-19? Bibliometric analysis of scientific literature

In this article, we conduct data mining to discover the countries, unive...
research
12/09/2021

Process Mining-Driven Analysis of the COVID19 Impact on the Vaccinations of Victorian Patients

Process mining is a discipline sitting between data mining and process s...
research
03/25/2020

What is the people posting about symptoms related to Coronavirus in Bogota, Colombia?

During the last months, there is an increasing alarm about a new mutatio...
research
08/01/2022

Data Collection and Analysis of French Dialects

This paper discusses creating and analysing a new dataset for data minin...
research
04/05/2020

Information Mining for COVID-19 Research From a Large Volume of Scientific Literature

The year 2020 has seen an unprecedented COVID-19 pandemic due to the out...
research
09/07/2020

Text Mining over Curriculum Vitae of Peruvian Professionals using Official Scientific Site DINA

During the last decade, Peruvian government started to invest and promot...

Please sign up or login with your details

Forgot password? Click here to reset