Automatic size and pose homogenization with spatial transformer network to improve and accelerate pediatric segmentation

07/06/2021
by   Giammarco La Barbera, et al.
0

Due to a high heterogeneity in pose and size and to a limited number of available data, segmentation of pediatric images is challenging for deep learning methods. In this work, we propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN). Our architecture is composed of three sequential modules that are estimated together during training: (i) a regression module to estimate a similarity matrix to normalize the input image to a reference one; (ii) a differentiable module to find the region of interest to segment; (iii) a segmentation module, based on the popular UNet architecture, to delineate the object. Unlike the original UNet, which strives to learn a complex mapping, including pose and scale variations, from a finite training dataset, our segmentation module learns a simpler mapping focusing on images with normalized pose and size. Furthermore, the use of an automatic bounding box detection through STN allows saving time and especially memory, while keeping similar performance. We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners. Results indicate that the estimated STN homogenization of size and pose accelerates the segmentation (25h), compared to standard data-augmentation (33h), while obtaining a similar quality for the kidney (88.01% of Dice score) and improving the renal tumor delineation (from 85.52% to 87.12%).

READ FULL TEXT

page 1

page 5

research
11/18/2022

Joint nnU-Net and Radiomics Approaches for Segmentation and Prognosis of Head and Neck Cancers with PET/CT images

Automatic segmentation of head and neck cancer (HNC) tumors and lymph no...
research
08/09/2019

Hyper Vision Net: Kidney Tumor Segmentation Using Coordinate Convolutional Layer and Attention Unit

KiTs19 challenge paves the way to haste the improvement of solid kidney ...
research
04/08/2021

CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

In this work, we tackle the problem of category-level online pose tracki...
research
09/22/2021

T6D-Direct: Transformers for Multi-Object 6D Pose Direct Regression

6D pose estimation is the task of predicting the translation and orienta...
research
10/15/2021

Combining CNNs With Transformer for Multimodal 3D MRI Brain Tumor Segmentation With Self-Supervised Pretraining

We apply an ensemble of modified TransBTS, nnU-Net, and a combination of...
research
07/13/2020

Improving Pixel Embedding Learning through Intermediate Distance Regression Supervision for Instance Segmentation

As a proposal-free approach, instance segmentation through pixel embeddi...
research
07/04/2019

LumièreNet: Lecture Video Synthesis from Audio

We present LumièreNet, a simple, modular, and completely deep-learning b...

Please sign up or login with your details

Forgot password? Click here to reset