KGTK: A Toolkit for Large Knowledge Graph Manipulation and Analysis

05/29/2020
by   Filip Ilievski, et al.
21

Knowledge graphs (KGs) have become the preferred technology for representing, sharing and adding knowledge to modern AI applications. While KGs have become a mainstream technology, the RDF/SPARQL-centric toolset for operating with them at scale is heterogeneous, difficult to integrate and only covers a subset of the operations that are commonly needed in data science applications. In this paper, we present KGTK, a data science-centric toolkit to represent, create, transform, enhance and analyze KGs. KGTK represents graphs in tables and leverages popular libraries developed for data science applications, enabling a wide audience of developers to easily construct knowledge graph pipelines for their applications. We illustrate KGTK with real-world scenarios in which we have used KGTK to integrate and manipulate large KGs, such as Wikidata, DBpedia and ConceptNet, in our own work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2018

Data Science with Vadalog: Bridging Machine Learning and Reasoning

Following the recent successful examples of large technology companies, ...
research
09/11/2021

Discovering Technology Gaps using the IntSight Knowledge Navigator

Knowledge analysis is an important application of knowledge graphs. In t...
research
03/03/2023

Linked Data Science Powered by Knowledge Graphs

In recent years, we have witnessed a growing interest in data science no...
research
08/20/2022

Comparing graph data science libraries for querying and analysing datasets: towards data science queries on graphs

This paper presents an experimental study to compare analysis tools with...
research
11/10/2022

Wikidata-lite for Knowledge Extraction and Exploration

Wikidata is the largest collaborative general knowledge graph supported ...
research
11/25/2021

Federated Data Science to Break Down Silos [Vision]

Similar to Open Data initiatives, data science as a community has launch...
research
11/19/2021

Types for Tables: A Language Design Benchmark

Context: Tables are ubiquitous formats for data. Therefore, techniques f...

Please sign up or login with your details

Forgot password? Click here to reset