NEMA: Automatic Integration of Large Network Management Databases

06/01/2020
by   Fubao Wu, et al.
0

Network management, whether for malfunction analysis, failure prediction, performance monitoring and improvement, generally involves large amounts of data from different sources. To effectively integrate and manage these sources, automatically finding semantic matches among their schemas or ontologies is crucial. Existing approaches on database matching mainly fall into two categories. One focuses on the schema-level matching based on schema properties such as field names, data types, constraints and schema structures. Network management databases contain massive tables (e.g., network products, incidents, security alert and logs) from different departments and groups with nonuniform field names and schema characteristics. It is not reliable to match them by those schema properties. The other category is based on the instance-level matching using general string similarity techniques, which are not applicable for the matching of large network management databases. In this paper, we develop a matching technique for large NEtwork MAnagement databases (NEMA) deploying instance-level matching for effective data integration and connection. We design matching metrics and scores for both numerical and non-numerical fields and propose algorithms for matching these fields. The effectiveness and efficiency of NEMA are evaluated by conducting experiments based on ground truth field pairs in large network management databases. Our measurement on large databases with 1,458 fields, each of which contains over 10 million records, reveals that the accuracies of NEMA are up to 95 achieves 2

READ FULL TEXT

page 1

page 11

page 14

research
08/05/2018

Schema Integration on Massive Data Sources

As the fundamental phrase of collecting and analyzing data, data integra...
research
05/11/2023

A Semi-Automated Hybrid Schema Matching Framework for Vegetation Data Integration

Integrating disparate and distributed vegetation data is critical for co...
research
07/10/2014

XML Matchers: approaches and challenges

Schema Matching, i.e. the process of discovering semantic correspondence...
research
10/14/2020

Valentine: Evaluating Matching Techniques for Dataset Discovery

Data scientists today search large data lakes to discover and integrate ...
research
10/05/2021

Notarial timestamps savings in logs management via Merkle trees and Key Derivation Functions

Nowadays log files handling imposes to ISPs (intended in their widest sc...
research
09/15/2021

PoWareMatch: a Quality-aware Deep Learning Approach to Improve Human Schema Matching

Schema matching is a core task of any data integration process. Being in...
research
01/31/2022

Eris: Measuring discord among multidimensional data sources

Data integration is a classical problem in databases, typically decompos...

Please sign up or login with your details

Forgot password? Click here to reset