RWebData: A High-Level Interface to the Programmable Web

03/01/2016
by   Ulrich Matter, et al.
0

The rise of the programmable web offers new opportunities for the empirically driven social sciences. The access, compilation and preparation of data from the programmable web for statistical analysis can, however, involve substantial up-front costs for the practical researcher. The R-package RWebData provides a high-level framework that allows data to be easily collected from the programmable web in a format that can directly be used for statistical analysis in R (R Core Team 2013) without bothering about the data's initial format and nesting structure. It was developed specifically for users who have no experience with web technologies and merely use R as a statistical software. The core idea and methodological contribution of the package are the disentangling of parsing web data and mapping them with a generic algorithm (independent of the initial data structure) to a flat table-like representation. This paper provides an overview of the high-level functions for R-users, explains the basic architecture of the package, and illustrates the implemented data mapping algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2022

floodlight – A high-level, data-driven sports analytics framework

The present work introduces floodlight, an open source Python package bu...
research
03/31/2020

The Case For Alternative Web Archival Formats To Expedite The Data-To-Insight Cycle

The WARC file format is widely used by web archives to preserve collecte...
research
07/17/2019

Spherical data handling and analysis with R package rcosmo

The R package rcosmo was developed for handling and analysing Hierarchic...
research
03/26/2019

OpenMX Viewer: A web-based crystalline and molecular graphical user interface program

The OpenMX Viewer (Open source package for Material eXplorer Viewer) is ...
research
10/28/2020

Essential Scattering Applications for Everyone. Overview

ESCAPE is a free python package and framework for creating applications ...
research
07/21/2022

Toward a Generic Mapping Language for Transformations between RDF and Data Interchange Formats

While there exist approaches to integrate heterogeneous data using seman...
research
07/03/2020

Regulation conform DLT-operable payment adapter based on trustless - justified trust combined generalized state channels

Open technologies, decentralized computation and intelligent application...

Please sign up or login with your details

Forgot password? Click here to reset