DeepAI AI Chat
Log In Sign Up

Theory-plus-code documentation of the DEPAM workflow for soundscape description

by   D. Cazau, et al.

In the Big Data era, the community of PAM faces strong challenges, including the need for more standardized processing tools accross its different applications in oceanography, and for more scalable and high-performance computing systems to process more efficiently the everly growing datasets. In this work we address conjointly both issues by first proposing a detailed theory-plus-code document of a classical analysis workflow to describe the content of PAM data, which hopefully will be reviewed and adopted by a maximum of PAM experts to make it standardized. Second, we transposed this workflow into the Scala language within the Spark/Hadoop frameworks so it can be directly scaled out on several node cluster.


page 1

page 2

page 3

page 4


MaRe: Container-Based Parallel Computing with Data Locality

Application containers are emerging as key components in scientific proc...

Big enterprise registration data imputation: Supporting spatiotemporal analysis of industries in China

Big, fine-grained enterprise registration data that includes time and lo...

BioWorkbench: A High-Performance Framework for Managing and Analyzing Bioinformatics Experiments

Advances in sequencing techniques have led to exponential growth in biol...

WfBench: Automated Generation of Scientific Workflow Benchmarks

The prevalence of scientific workflows with high computational demands c...

Using Big Data Technologies for HEP Analysis

The HEP community is approaching an era were the excellent performances ...

Toward a System Building Agenda for Data Integration

In this paper we argue that the data management community should devote ...