Interactive Data Integration through Smart Copy & Paste

09/09/2009
by   Zachary Ives, et al.
0

In many scenarios, such as emergency response or ad hoc collaboration, it is critical to reduce the overhead in integrating data. Ideally, one could perform the entire process interactively under one unified interface: defining extractors and wrappers for sources, creating a mediated schema, and adding schema mappings ? while seeing how these impact the integrated view of the data, and refining the design accordingly. We propose a novel smart copy and paste (SCP) model and architecture for seamlessly combining the design-time and run-time aspects of data integration, and we describe an initial prototype, the CopyCat system. In CopyCat, the user does not need special tools for the different stages of integration: instead, the system watches as the user copies data from applications (including the Web browser) and pastes them into CopyCat?s spreadsheet-like workspace. CopyCat generalizes these actions and presents proposed auto-completions, each with an explanation in the form of provenance. The user provides feedback on these suggestions ? through either direct interactions or further copy-and-paste operations ? and the system learns from this feedback. This paper provides an overview of our prototype system, and identifies key research challenges in achieving SCP in its full generality.

READ FULL TEXT

page 3

page 4

research
07/20/2021

Information Integration using the Typed Graph Model

Schema and data integration have been a challenge for more than 40 years...
research
10/13/2018

Integration in terms of polylogarithm

This paper provides a Liouville principle for integration in terms of di...
research
10/15/2020

Survive the Schema Changes: Integration of Unmanaged Data Using Deep Learning

Data is the king in the age of AI. However data integration is often a l...
research
07/22/2011

Consistent Query Answering via ASP from Different Perspectives: Theory and Practice

A data integration system provides transparent access to different data ...
research
07/04/2023

A Prototype for a Controlled and Valid RDF Data Production Using SHACL

The paper introduces a tool prototype that combines SHACL's capabilities...
research
10/26/2022

Dragoman: Efficiently Evaluating Declarative Mapping Languages over Frameworks for Knowledge Graph Creation

In recent years, there have been valuable efforts and contributions to m...

Please sign up or login with your details

Forgot password? Click here to reset