Learning the Semantics of Structured Data Sources

01/16/2016
by   Mohsen Taheriyan, et al.
0

Information sources such as relational databases, spreadsheets, XML, JSON, and Web APIs contain a tremendous amount of structured data that can be leveraged to build and augment knowledge graphs. However, they rarely provide a semantic model to describe their contents. Semantic models of data sources represent the implicit meaning of the data by specifying the concepts and the relationships within the data. Such models are the key ingredients to automatically publish the data into knowledge graphs. Manually modeling the semantics of data sources requires significant effort and expertise, and although desirable, building these models automatically is a challenging problem. Most of the related work focuses on semantic annotation of the data fields (source attributes). However, constructing a semantic model that explicitly describes the relationships between the attributes in addition to their semantic types is critical. We present a novel approach that exploits the knowledge from a domain ontology and the semantic models of previously modeled sources to automatically learn a rich semantic model for a new source. This model represents the semantics of the new source in terms of the concepts and relationships defined by the domain ontology. Given some sample data from the new source, we leverage the knowledge in the domain ontology and the known semantic models to construct a weighted graph that represents the space of plausible semantic models for the new source. Then, we compute the top k candidate semantic models and suggest to the user a ranked list of the semantic models for the new source. The approach takes into account user corrections to learn more accurate semantic models on future data sources. Our evaluation shows that our method generates expressive semantic models for data sources and services with minimal user input. ...

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2022

Automatic Semantic Modeling for Structural Data Source with the Prior Knowledge from Knowledge Base

A critical step in sharing semantic content online is to map the structu...
research
01/29/2018

Evaluating approaches for supervised semantic labeling

Relational data sources are still one of the most popular ways to store ...
research
12/03/2020

Mapping Patterns for Virtual Knowledge Graphs

Virtual Knowledge Graphs (VKG) constitute one of the most promising para...
research
05/14/2020

Towards NLP-supported Semantic Data Management

The heterogeneity of data poses a great challenge when data from differe...
research
09/05/2019

Identifying and Explaining Discriminative Attributes

Identifying what is at the center of the meaning of a word and what disc...
research
10/04/2022

Conceptual Modeling Applied to Data Semantics

In software system design, one of the purposes of diagrammatic modeling ...
research
10/01/2015

A Generative Model of Words and Relationships from Multiple Sources

Neural language models are a powerful tool to embed words into semantic ...

Please sign up or login with your details

Forgot password? Click here to reset