regulAS: A Bioinformatics Tool for the Integrative Analysis of Alternative Splicing Regulome using RNA-Seq data

07/17/2023
by   Sofya Lipnitskaya, et al.
0

The regulAS software package is a bioinformatics tool designed to support computational biology researchers in investigating regulatory mechanisms of splicing alterations through integrative analysis of large-scale RNA-Seq data from cancer and healthy human donors, characterized by TCGA and GTEx projects. This technical report provides a comprehensive overview of regulAS, focusing on its core functionality, basic modules, experiment configuration, further extensibility and customisation. The core functionality of regulAS enables the automation of computational experiments, efficient results storage and processing, and streamlined workflow management. Integrated basic modules extend regulAS with features such as RNA-Seq data retrieval from the public multi-omics UCSC Xena data repository, predictive modeling and feature ranking capabilities using the scikit-learn package, and flexible reporting generation for analysing gene expression profiles and relevant modulations of alternative splicing aberrations across tissues and cancer types. Experiment configuration is handled through YAML files with the Hydra and OmegaConf libraries, offering a user-friendly approach. Additionally, regulAS allows for the development and integration of custom modules to handle specialized tasks. In conclusion, regulAS provides an automated solution for alternative splicing and cancer biology studies, enhancing efficiency, reproducibility, and customization of experimental design, while the extensibility of the pipeline enables researchers to further tailor the software package to their specific needs. Source code is available under the MIT license at https://github.com/slipnitskaya/regulAS.

READ FULL TEXT

page 1

page 2

research
08/07/2019

HyperStream: a Workflow Engine for Streaming Data

This paper describes HyperStream, a large-scale, flexible and robust sof...
research
10/23/2019

A Deep Learning based Pipeline for Efficient Oral Cancer Screening on Whole Slide Images

Oral cancer incidence is rapidly increasing worldwide. The most importan...
research
03/08/2023

RANG: Reconstructing reproducible R computational environments

A complete declarative description of the computational environment is o...
research
11/13/2020

NLMEModeling: A Wolfram Mathematica Package for Nonlinear Mixed Effects Modeling of Dynamical Systems

Nonlinear mixed effects modeling is a powerful tool when analyzing data ...
research
09/27/2019

Telescope: an interactive tool for managing large scale analysis from mobile devices

In today's world of big data, computational analysis has become a key dr...
research
06/15/2021

Rcall: Calling R from Matlab

Summary: R and Matlab are two high-level scientific programming language...
research
06/12/2019

Migrating large codebases to C++ Modules

ROOT has several features which interact with libraries and require impl...

Please sign up or login with your details

Forgot password? Click here to reset