Coresets for Relational Data and The Applications

10/09/2022
by   Jiaxiang Chen, et al.
0

A coreset is a small set that can approximately preserve the structure of the original input data set. Therefore we can run our algorithm on a coreset so as to reduce the total computational complexity. Conventional coreset techniques assume that the input data set is available to process explicitly. However, this assumption may not hold in real-world scenarios. In this paper, we consider the problem of coresets construction over relational data. Namely, the data is decoupled into several relational tables, and it could be very expensive to directly materialize the data matrix by joining the tables. We propose a novel approach called “aggregation tree with pseudo-cube” that can build a coreset from bottom to up. Moreover, our approach can neatly circumvent several troublesome issues of relational learning problems [Khamis et al., PODS 2019]. Under some mild assumptions, we show that our coreset approach can be applied for the machine learning tasks, such as clustering, logistic regression and SVM.

READ FULL TEXT

page 10

page 21

research
06/28/2016

A Learning Algorithm for Relational Logistic Regression: Preliminary Results

Relational logistic regression (RLR) is a representation of conditional ...
research
10/11/2019

Rk-means: Fast Clustering for Relational Data

Conventional machine learning algorithms cannot be applied until a data ...
research
12/27/2012

On-line relational SOM for dissimilarity data

In some applications and in order to address real world situations bette...
research
08/01/2020

Relational Algorithms for k-means Clustering

The majority of learning tasks faced by data scientists involve relation...
research
07/25/2017

Comparing Aggregators for Relational Probabilistic Models

Relational probabilistic models have the challenge of aggregation, where...
research
03/05/2020

Simple and sharp analysis of k-means||

We present a truly simple analysis of k-means|| (Bahmani et al., PVLDB 2...
research
11/10/2017

Automated Migration of Hierarchical Data to Relational Tables using Programming-by-Example

While many applications export data in hierarchical formats like XML and...

Please sign up or login with your details

Forgot password? Click here to reset