Exactly mergeable summaries

03/25/2023
by   Vladimir Batagelj, et al.
0

In the analysis of large/big data sets, aggregation (replacing values of a variable over a group by a single value) is a standard way of reducing the size (complexity) of the data. Data analysis programs provide different aggregation functions. Recently some books dealing with the theoretical and algorithmic background of traditional aggregation functions were published. A problem with traditional aggregation is that often too much information is discarded thus reducing the precision of the obtained results. A much better, preserving more information, summarization of original data can be achieved by representing aggregated data using selected types of complex data. In complex data analysis the measured values over a selected group A are aggregated into a complex object Σ(A) and not into a single value. Most of the aggregation functions theory does not apply directly. In our contribution, we present an attempt to start building a theoretical background of complex aggregation. We introduce and discuss exactly mergeable summaries for which it holds for merging of disjoint sets of units Σ(A ∪ B) = F( Σ(A),Σ(B)), A∩ B = ∅ .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2019

Ordered Sets for Data Analysis

This book dwells on mathematical and algorithmic issues of data analysis...
research
10/09/2018

Description of sup- and inf-preserving aggregation functions via families of clusters in data tables

Connection between the theory of aggregation functions and formal concep...
research
09/26/2013

Stochastic Rank Aggregation

This paper addresses the problem of rank aggregation, which aims to find...
research
07/27/2020

VFL: A Verifiable Federated Learning with Privacy-Preserving for Big Data in Industrial IoT

Due to the strong analytical ability of big data, deep learning has been...
research
01/31/2021

Contextualized Rewriting for Text Summarization

Extractive summarization suffers from irrelevance, redundancy and incohe...
research
10/26/2019

PREMA: Principled Tensor Data Recovery from Multiple Aggregated Views

Multidimensional data have become ubiquitous and are frequently involved...
research
01/20/2018

Nonfractional Memory: Filtering, Antipersistence, and Forecasting

The fractional difference operator remains to be the most popular mechan...

Please sign up or login with your details

Forgot password? Click here to reset