Ignorance-Aware Approaches and Algorithms for Prototype Selection in Machine Learning

05/15/2019
by   Vagan Terziyan, et al.
0

Operating with ignorance is an important concern of the Machine Learning research, especially when the objective is to discover knowledge from the imperfect data. Data mining (driven by appropriate knowledge discovery tools) is about processing available (observed, known and understood) samples of data aiming to build a model (e.g., a classifier) to handle data samples, which are not yet observed, known or understood. These tools traditionally take samples of the available data (known facts) as an input for learning. We want to challenge the indispensability of this approach and we suggest considering the things the other way around. What if the task would be as follows: how to learn a model based on our ignorance, i.e. by processing the shape of 'voids' within the available data space? Can we improve traditional classification by modeling also the ignorance? In this paper, we provide some algorithms for the discovery and visualizing of the ignorance zones in two-dimensional data spaces and design two ignorance-aware smart prototype selection techniques (incremental and adversarial) to improve the performance of the nearest neighbor classifiers. We present experiments with artificial and real datasets to test the concept of the usefulness of ignorance discovery in machine learning.

READ FULL TEXT
research
01/16/2020

Smart Data based Ensemble for Imbalanced Big Data Classification

Big Data scenarios pose a new challenge to traditional data mining algor...
research
06/04/2018

Learning from Exemplars and Prototypes in Machine Learning and Psychology

This paper draws a parallel between similarity-based categorisation mode...
research
11/29/2019

Prototype Selection Based on Clustering and Conformance Metrics for Model Discovery

Process discovery aims at automatically creating process models on the b...
research
09/02/2020

Nearest Neighbor Search for Hyperbolic Embeddings

Embedding into hyperbolic space is emerging as an effective representati...
research
03/24/2018

A Dynamic-Adversarial Mining Approach to the Security of Machine Learning

Operating in a dynamic real world environment requires a forward thinkin...
research
12/05/2020

Data-based Discovery of Governing Equations

Most common mechanistic models are traditionally presented in mathematic...
research
04/18/2022

Trinary Tools for Continuously Valued Binary Classifiers

Classification methods for binary (yes/no) tasks often produce a continu...

Please sign up or login with your details

Forgot password? Click here to reset