Preprocessing Methods and Pipelines of Data Mining: An Overview

06/20/2019
by   Canchen Li, et al.
0

Data mining is about obtaining new knowledge from existing datasets. However, the data in the existing datasets can be scattered, noisy, and even incomplete. Although lots of effort is spent on developing or fine-tuning data mining models to make them more robust to the noise of the input data, their qualities still strongly depend on the quality of it. The article starts with an overview of the data mining pipeline, where the procedures in a data mining task are briefly introduced. Then an overview of the data preprocessing techniques which are categorized as the data cleaning, data transformation and data preprocessing is given. Detailed preprocessing methods, as well as their influenced on the data mining models, are covered in this article.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2012

Soil Data Analysis Using Classification Techniques and Soil Attribute Prediction

Agricultural research has been profited by technical advances such as au...
research
05/28/2017

Subject Specific Stream Classification Preprocessing Algorithm for Twitter Data Stream

Micro-blogging service Twitter is a lucrative source for data mining app...
research
05/12/2017

An Overview of Data Mining Applications in Oil and Gas Exploration: Structural Geology and Reservoir Property-Issues

Low oil prices have motivated energy executives to look into cost reduct...
research
11/08/2021

A Novel Data Pre-processing Technique: Making Data Mining Robust to Different Units and Scales of Measurement

Many existing data mining algorithms use feature values directly in thei...
research
09/16/2020

Similarity-based data mining for online domain adaptation of a sonar ATR system

Due to the expensive nature of field data gathering, the lack of trainin...
research
12/24/2009

Similarité en intension vs en extension : à la croisée de l'informatique et du théâtre

Traditional staging is based on a formal approach of similarity leaning ...
research
08/21/2019

Visualization in the preprocessing phase: an interview study with enterprise professionals

The current information age has increasingly required organizations to b...

Please sign up or login with your details

Forgot password? Click here to reset