Distributed Feature Extraction in a P2P Setting - A Case Study

10/26/2020
by   Katharina Morik, et al.
0

Finding the right data representation is essential for virtually every data mining application. In this work we describe an approach to collaborative feature extraction, selection and aggregation in distributed, loosely coupled domains. In contrast to other work in the field of distributed data mining, we focus on scenarios in which a large number of loosely coupled nodes apply data mining to different, usually very small and overlapping, subsets of the entire data space. The aim is not to find a global concept to cover all data, but to learn a set of local concepts. Our prototypical application is a distributed media organization platform, called Nemoz, that assists users in maintaining their media collections. We propose two models for collaborative feature extraction, selection and aggregation for supervised data mining. One is based on a centralized p2p architecture, and the other on a fully distributed p2p architecture. We compare both models on a real world data set and discuss their advantages and problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2020

Nemoz - A Distributed Framework for Collaborative Media Organization

Multimedia applications have received quite some interest. Embedding the...
research
07/05/2021

A contextual analysis of multi-layer perceptron models in classifying hand-written digits and letters: limited resources

Classifying hand-written digits and letters has taken a big leap with th...
research
07/27/2020

Innovative Platform for Designing Hybrid Collaborative Context-Aware Data Mining Scenarios

The process of knowledge discovery involves nowadays a major number of t...
research
07/03/2012

Relational Data Mining Through Extraction of Representative Exemplars

With the growing interest on Network Analysis, Relational Data Mining is...
research
10/26/2020

Aspect-Based Tagging for Collaborative Media Organization

Organizing multimedia data is very challenging. One of the most importan...
research
06/02/2022

Approximate Network Motif Mining Via Graph Learning

Frequent and structurally related subgraphs, also known as network motif...
research
06/20/2023

Polytope: An Algorithm for Efficient Feature Extraction on Hypercubes

Data extraction algorithms on data hypercubes, or datacubes, are traditi...

Please sign up or login with your details

Forgot password? Click here to reset