The Hidden Web, XML and Semantic Web: A Scientific Data Management Perspective

05/10/2011
by   Fabian Suchanek, et al.
0

The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.

READ FULL TEXT
research
07/02/2021

Web Archive Analytics

Web archive analytics is the exploitation of publicly accessible web pag...
research
06/20/2023

A Responsive Framework for Research Portals Data using Semantic Web Technology

As the amount of data on the World Wide Web continues to grow exponentia...
research
11/13/2017

Towards a Cloud-Based Service for Maintaining and Analyzing Data About Scientific Events

We propose the new cloud-based service OpenResearch for managing and ana...
research
10/17/2019

Service Wrapper: a system for converting web data into web services

Web services are widely used in many areas via callable APIs, however, d...
research
04/27/2018

Extracting Parallel Paragraphs from Common Crawl

Most of the current methods for mining parallel texts from the web assum...
research
10/11/2017

Explaining Trained Neural Networks with Semantic Web Technologies: First Steps

The ever increasing prevalence of publicly available structured data on ...
research
01/25/2023

The Synchronic Web

The Synchronic Web is a distributed network for securing data provenance...

Please sign up or login with your details

Forgot password? Click here to reset