ConTrOn: Continuously Trained Ontology based on Technical Data Sheets and Wikidata

06/16/2019
by   Kobkaew Opasjumruskit, et al.
0

In engineering projects involving various parts from global suppliers, one common task is to determine which parts are best suited for the project requirements. Information about specific parts' characteristics is published in so called data sheets. However, these data sheets are oftentimes only published in textual form, e.g., as a PDF. Hence, they have to be transformed into a machine-interpretable format. This transformation process still requires a lot of manual intervention and is prone to errors. Automated approaches make use of ontologies to capture the given domain and thus improve automated information extraction from the data sheets. However, ontologies rely solely on experiences and perspectives of their creators at the time of creation and cannot accumulate knowledge over time on their own. This paper presents ConTrOn -- Continuously Trained Ontology -- a system that automatically augments ontologies. ConTrOn tackles terminology problems by combining the knowledge extracted from data sheets with an ontology created by domain experts and external knowledge bases such as WordNet and Wikidata. To demonstrate how the enriched ontology can improve the information extraction process, we selected data sheets from spacecraft development as a use case. The evaluation results show that the amount of information extracted from data sheets based on ontologies is significantly increased after the ontology enrichment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2022

Ontology Development is Consensus Creation, Not (Merely) Representation

Ontology development methodologies emphasise knowledge gathering from do...
research
11/09/2017

Repairing Ontologies via Axiom Weakening

Ontology engineering is a hard and error-prone task, in which small chan...
research
05/23/2021

Automatic Product Ontology Extraction from Textual Reviews

Ontologies have proven beneficial in different settings that make use of...
research
10/12/2015

Data structuring for the ontological modelling of wind energy systems

Small wind projects encounter difficulties to be efficiently deployed, p...
research
12/18/2015

Ontology-driven Information Extraction

Homogeneous unstructured data (HUD) are collections of unstructured docu...
research
05/23/2018

RDF2Vec-based Classification of Ontology Alignment Changes

When ontologies cover overlapping topics, the overlap can be represented...

Please sign up or login with your details

Forgot password? Click here to reset