Medical Image Deidentification, Cleaning and Compression Using Pylogik

04/20/2023
by   Adrienne Kline, et al.
0

Leveraging medical record information in the era of big data and machine learning comes with the caveat that data must be cleaned and deidentified. Facilitating data sharing and harmonization for multi-center collaborations are particularly difficult when protected health information (PHI) is contained or embedded in image meta-data. We propose a novel library in the Python framework, called PyLogik, to help alleviate this issue for ultrasound images, which are particularly challenging because of the frequent inclusion of PHI directly on the images. PyLogik processes the image volumes through a series of text detection/extraction, filtering, thresholding, morphological and contour comparisons. This methodology deidentifies the images, reduces file sizes, and prepares image volumes for applications in deep learning and data sharing. To evaluate its effectiveness in the identification of regions of interest (ROI), a random sample of 50 cardiac ultrasounds (echocardiograms) were processed through PyLogik, and the outputs were compared with the manual segmentations by an expert user. The Dice coefficient of the two approaches achieved an average value of 0.976. Next, an investigation was conducted to ascertain the degree of information compression achieved using the algorithm. Resultant data was found to be on average approximately 72 results suggest that PyLogik is a viable methodology for ultrasound data cleaning and deidentification, determining ROI, and file compression which will facilitate efficient storage, use, and dissemination of ultrasound data.

READ FULL TEXT

page 3

page 5

page 6

page 8

research
01/17/2019

UltraCompression: Framework for High Density Compression of Ultrasound Volumes using Physics Modeling Deep Neural Networks

Ultrasound image compression by preserving speckle-based key information...
research
05/09/2023

Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation

We propose a novel pipeline for the generation of synthetic images via D...
research
04/28/2019

X-Ray Image Compression Using Convolutional Recurrent Neural Networks

In the advent of a digital health revolution, vast amounts of clinical d...
research
06/07/2023

SMRVIS: Point cloud extraction from 3-D ultrasound for non-destructive testing

We propose to formulate point cloud extraction from ultrasound volumes a...
research
03/24/2023

Removing confounding information from fetal ultrasound images

Confounding information in the form of text or markings embedded in medi...
research
09/05/2022

Domain Generalization for Prostate Segmentation in Transrectal Ultrasound Images: A Multi-center Study

Prostate biopsy and image-guided treatment procedures are often performe...
research
09/03/2020

Exploratory Analysis of File System Metadata for Rapid Investigation of Security Incidents

Investigating cybersecurity incidents requires in-depth knowledge from t...

Please sign up or login with your details

Forgot password? Click here to reset