From Base Data To Knowledge Discovery – A Life Cycle Approach – Using Multilayer Networks

05/24/2021
by   Abhishek Santra, et al.
0

Any large complex data analysis to infer or discover meaningful information/knowledge involves the following steps (in addition to data collection, cleaning, preparing the data for analysis such as attribute elimination): i) Modeling the data – an approach for modeling and deriving a data representation for analysis using that approach, ii) translating analysis objectives into computations on the model generated; this can be as simple as a single computation (e.g., community detection) or may involve a sequence of operations (e.g., pair-wise community detection over multiple networks) using expressions based on the model, iii) computation of the expressions generated – efficiency and scalability come into picture here, and iv) drill-down of results to interpret or understand them clearly. Beyond this, it is also meaningful to visualize results for easier understanding. Covid-19 visualization dashboard presented in this paper is an example of this. This paper covers all of the above steps of data analysis life cycle using a data representation that is gaining importance for multi-entity, multi-feature data sets - Multilayer Networks. We use several data sets to establish the effectiveness of modeling using MLNs and analyze them using the proposed decoupling approach. For coverage, we use different types of MLNs for modeling, and community and centrality computations for analysis. The data sets used - US commercial airlines, IMDb, DBLP, and Covid-19 data set. Our experimental analyses using the identified steps validate modeling, breadth of objectives that can be computed, and overall versatility of the life cycle approach. Correctness of results is verified, where possible, using independently available ground truth. We demonstrate drill-down that is afforded by this approach (due to structure and semantics preservation) for a better understanding and visualization of results.

READ FULL TEXT

page 14

page 23

page 32

page 33

page 34

page 36

page 37

research
09/21/2019

Making a Case for MLNs for Data-Driven Analysis: Modeling, Efficiency, and Versatility

Datasets of real-world applications are characterized by entities of dif...
research
06/09/2021

An Extensible Dashboard Architecture For Visualizing Base And Analyzed Data

Any data analysis, especially the data sets that may be changing often o...
research
04/20/2020

A New Community Definition For MultiLayer Networks And A Novel Approach For Its Efficient Computation

As the use of MultiLayer Networks (or MLNs) for modeling and analysis is...
research
09/07/2019

An Efficient Framework for Computing Structure- And Semantics-Preserving Community in a Heterogeneous Multilayer Network

Multilayer networks or MLNs (also called multiplexes or network of netwo...
research
03/06/2019

Structure-Preserving Community In A Multilayer Network: Definition, Detection, And Analysis

Multilayer networks or MLNs (also called multiplexes or network of netwo...
research
07/24/2022

Degree Centrality Algorithms For Homogeneous Multilayer Networks

Centrality measures for simple graphs/networks are well-defined and each...
research
12/07/2021

A graph representation based on fluid diffusion model for multimodal data analysis: theoretical aspects and enhanced community detection

Representing data by means of graph structures identifies one of the mos...

Please sign up or login with your details

Forgot password? Click here to reset