Minerva and minepy: a C engine for the MINE suite and its R, Python and MATLAB wrappers

08/21/2012
by   Davide Albanese, et al.
0

We introduce a novel implementation in ANSI C of the MINE family of algorithms for computing maximal information-based measures of dependence between two variables in large datasets, with the aim of a low memory footprint and ease of integration within bioinformatics pipelines. We provide the libraries minerva (with the R interface) and minepy for Python, MATLAB, Octave and C++. The C solution reduces the large memory requirement of the original Java implementation, has good upscaling properties, and offers a native parallelization for the R interface. Low memory requirements are demonstrated on the MINE benchmarks as well as on large (n=1340) microarray and Illumina GAII RNA-seq transcriptomics datasets. Availability and Implementation: Source code and binaries are freely available for download under GPL3 licence at http://minepy.sourceforge.net for minepy and through the CRAN repository http://cran.r-project.org for the R package minerva. All software is multiplatform (MS Windows, Linux and OSX).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/13/2023

Py-Tetrad and RPy-Tetrad: A New Python Interface with R Support for Tetrad Causal Search

We give novel Python and R interfaces for the (Java) Tetrad project for ...
research
05/04/2018

Pytrec_eval: An Extremely Fast Python Interface to trec_eval

We introduce pytrec_eval, a Python interface to the tree_eval informatio...
research
03/27/2021

NMRPy: a novel NMR scripting system to implement artificial intelligence and advanced applications

Background: Software is an important windows to offer a variety of compl...
research
08/03/2023

PyPartMC: A Pythonic interface to a particle-resolved, Monte Carlo aerosol simulation framework

PyPartMC is a Pythonic interface to PartMC, a stochastic, particle-resol...
research
08/11/2017

OpenML Benchmarking Suites and the OpenML100

We advocate the use of curated, comprehensive benchmark suites of machin...
research
03/23/2023

GMP-Featurizer: A parallelized Python package for efficiently computing the Gaussian Multipole features of atomic systems

GMP-Featurizer is a lightweight, accurate, efficient, and scalable softw...
research
03/12/2010

Release ZERO.0.1 of package RefereeToolbox

RefereeToolbox is a java package implementing combination operators for ...

Please sign up or login with your details

Forgot password? Click here to reset