Advances in Big Data Bio Analytics

09/18/2019
by   Nicos Angelopoulos, et al.
0

Delivering effective data analytics is of crucial importance to the interpretation of the multitude of biological datasets currently generated by an ever increasing number of high throughput techniques. Logic programming has much to offer in this area. Here, we detail advances that highlight two of the strengths of logical formalisms in developing data analytic solutions in biological settings: access to large relational databases and building analytical pipelines collecting graph information from multiple sources. We present significant advances on the bio_db package which serves biological databases as Prolog facts that can be served either by in-memory loading or via database backends. These advances include modularising the underlying architecture and the incorporation of datasets from a second organism (mouse). In addition, we introduce a number of data analytics tools that operate on these datasets and are bundled in the analysis package: bio_analytics. Emphasis in both packages is on ease of installation and use. We highlight the general architecture of our components based approach. An experimental graphical user interface via SWISH for local installation is also available. Finally, we advocate that biological data analytics is a fertile area which can drive further innovation in applied logic programming.

READ FULL TEXT
research
01/19/2018

Big Data Analytics for Wireless and Wired Network Design: A Survey

Currently, the world is witnessing a mounting avalanche of data due to t...
research
12/07/2017

Columnar Database Techniques for Creating AI Features

Recent advances with in-memory columnar database techniques have increas...
research
11/30/2016

Contextualizing Geometric Data Analysis and Related Data Analytics: A Virtual Microscope for Big Data Analytics

The relevance and importance of contextualizing data analytics is descri...
research
01/29/2023

Data accounting and error counting

Can we infer sources of errors from outputs of the complex data analytic...
research
03/15/2021

SEMgraph: An R Package for Causal Network Analysis of High-Throughput Data with Structural Equation Models

With the advent of high-throughput sequencing (HTS) in molecular biology...
research
03/26/2019

Data4UrbanMobility: Towards Holistic Data Analytics for Mobility Applications in Urban Regions

With the increasing availability of mobility-related data, such as GPS-t...
research
12/05/2018

Graph based Question Answering System

In today's digital age in the dawning era of big data analytics it is no...

Please sign up or login with your details

Forgot password? Click here to reset