SPPAM - Statistical PreProcessing AlgorithM

03/11/2011
by   Tiago Silva, et al.
0

Most machine learning tools work with a single table where each row is an instance and each column is an attribute. Each cell of the table contains an attribute value for an instance. This representation prevents one important form of learning, which is, classification based on groups of correlated records, such as multiple exams of a single patient, internet customer preferences, weather forecast or prediction of sea conditions for a given day. To some extent, relational learning methods, such as inductive logic programming, can capture this correlation through the use of intensional predicates added to the background knowledge. In this work, we propose SPPAM, an algorithm that aggregates past observations in one single record. We show that applying SPPAM to the original correlated data, before the learning task, can produce classifiers that are better than the ones trained using all records.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2022

Table Enrichment System for Machine Learning

Data scientists are constantly facing the problem of how to improve pred...
research
09/05/2019

Table-to-Text Generation with Effective Hierarchical Encoder on Three Dimensions (Row, Column and Time)

Although Seq2Seq models for table-to-text generation have achieved remar...
research
09/22/2022

mini-ELSA: using Machine Learning to improve space efficiency in Edge Lightweight Searchable Attribute-based encryption for Industry 4.0

In previous work a novel Edge Lightweight Searchable Attribute-based enc...
research
08/11/2021

Retrieval Interaction Machine for Tabular Data Prediction

Prediction over tabular data is an essential task in many data science a...
research
06/14/2019

Comparing Machine Learning Approaches for Table Recognition in Historical Register Books

We present in this paper experiments on Table Recognition in hand-writte...
research
06/18/2020

Record fusion: A learning approach

Record fusion is the task of aggregating multiple records that correspon...
research
09/20/2023

Information Leakage from Data Updates in Machine Learning Models

In this paper we consider the setting where machine learning models are ...

Please sign up or login with your details

Forgot password? Click here to reset