Triclustering in Big Data Setting

10/24/2020
by   Dmitry Egurnov, et al.
0

In this paper, we describe versions of triclustering algorithms adapted for efficient calculations in distributed environments with MapReduce model or parallelisation mechanism provided by modern programming languages. OAC-family of triclustering algorithms shows good parallelisation capabilities due to the independent processing of triples of a triadic formal context. We provide the time and space complexity of the algorithms and justify their relevance. We also compare performance gain from using a distributed system and scalability.

READ FULL TEXT
research
11/24/2021

Systematic Analysis of Programming Languages and Their Execution Environments for Spectre Attacks

In this paper, we analyze the security of programming languages and thei...
research
08/22/2020

Assurance of Distributed Algorithms and Systems: Runtime Checking of Safety and Liveness

This paper presents a general framework and methods for complete program...
research
07/07/2017

InferSpark: Statistical Inference at Scale

The Apache Spark stack has enabled fast large-scale data processing. Des...
research
07/24/2021

Imbalanced Big Data Oversampling: Taxonomy, Algorithms, Software, Guidelines and Future Directions

Learning from imbalanced data is among the most challenging areas in con...
research
03/22/2015

Modeling browser-based distributed evolutionary computation systems

From the era of big science we are back to the "do it yourself", where y...
research
09/19/2017

Rbox: an integrated R package for ATOM Editor

R is a programming language and environment that is a central tool in th...
research
03/03/2019

Development details and computational benchmarking of DEPAM

In the big data era of observational oceanography, passive acoustics dat...

Please sign up or login with your details

Forgot password? Click here to reset