A Data Colocation Grid Framework for Big Data Medical Image Processing - Backend Design

12/22/2017
by   Shunxing Bao, et al.
0

When processing large medical imaging studies, adopting high performance grid computing resources rapidly becomes important. We recently presented a "medical image processing-as-a-service" grid framework that offers promise in utilizing the Apache Hadoop ecosystem and HBase for data colocation by moving computation close to medical image storage. However, the framework has not yet proven to be easy to use in a heterogeneous hardware environment. Furthermore, the system has not yet validated when considering variety of multi-level analysis in medical imaging. Our target criteria are (1) improving the framework's performance in a heterogeneous cluster, (2) performing population based summary statistics on large datasets, and (3) introducing a table design scheme for rapid NoSQL query. In this paper, we present a backend interface application program interface design for Hadoop & HBase for Medical Image Processing. The API includes: Upload, Retrieve, Remove, Load balancer and MapReduce templates. A dataset summary statistic model is discussed and implemented by MapReduce paradigm. We introduce a HBase table scheme for fast data query to better utilize the MapReduce model. Briefly, 5153 T1 images were retrieved from a university secure database and used to empirically access an in-house grid with 224 heterogeneous CPU cores. Three empirical experiments results are presented and discussed: (1) load balancer wall-time improvement of 1.5-fold compared with a framework with built-in data allocation strategy, (2) a summary statistic model is empirically verified on grid framework and is compared with the cluster when deployed with a standard Sun Grid Engine, which reduces 8-fold of wall clock time and 14-fold of resource time, and (3) the proposed HBase table scheme improves MapReduce computation with 7 fold reduction of wall time compare with a naïve scheme when datasets are relative small.

READ FULL TEXT

page 2

page 9

research
10/02/2017

CHIPS: A Service for Collecting, Organizing, Processing, and Sharing Medical Image Data in the Cloud

Web browsers are increasingly used as middleware platforms offering a ce...
research
02/25/2019

A Survey of Crowdsourcing in Medical Image Analysis

Rapid advances in image processing capabilities have been seen across ma...
research
06/01/2020

An Online Platform for Automatic Skull Defect Restoration and Cranial Implant Design

We introduce a fully automatic system for cranial implant design, a comm...
research
05/08/2023

Few Shot Learning for Medical Imaging: A Comparative Analysis of Methodologies and Formal Mathematical Framework

Deep learning becomes an elevated context regarding disposing of many ma...
research
01/27/2014

Computing support for advanced medical data analysis and imaging

We discuss computing issues for data analysis and image reconstruction o...

Please sign up or login with your details

Forgot password? Click here to reset