Automated Metadata Harmonization Using Entity Resolution Contextual Embedding

10/17/2020
by   Kunal Sawarkar, et al.
0

ML Data Curation process typically consist of heterogeneous federated source systems with varied schema structures; requiring curation process to standardize metadata from different schemas to an inter-operable schema. This manual process of Metadata Harmonization cataloging slows efficiency of ML-Ops lifecycle. We demonstrate automation of this step with the help of entity resolution methods also by using Cogntive Database's Db2Vec embedding approach to capture hidden inter-column intra-column relationships which detect similarity of metadata and then predict metadata columns from source schemas to any standardized schemas. Apart from matching schemas, we demonstrate that it can also infer the correct ontological structure of the target data model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2019

Column2Vec: Structural Understanding via Distributed Representations of Database Schemas

We present Column2Vec, a distributed representation of database columns ...
research
09/20/2019

Metadata Systems for Data Lakes: Models and Features

Over the past decade, the data lake concept has emerged as an alternativ...
research
02/27/2020

Data-Driven Metadata Tagging for Building Automation Systems: A Unified Architecture

This article presents a Unified Architecture for automated point tagging...
research
07/06/2023

JSONoid: Monoid-based Enrichment for Configurable and Scalable Data-Driven Schema Discovery

Schema discovery is an important aspect to working with data in formats ...
research
09/08/2023

Matching Table Metadata with Business Glossaries Using Large Language Models

Enterprises often own large collections of structured data in the form o...
research
07/19/2022

Metadata Representations for Queryable ML Model Zoos

Machine learning (ML) practitioners and organizations are building model...
research
06/25/2022

SiMa: Effective and Efficient Data Silo Federation Using Graph Neural Networks

Virtually every sizable organization nowadays is building a form of a da...

Please sign up or login with your details

Forgot password? Click here to reset