Targeted Gradient Descent: A Novel Method for Convolutional Neural Networks Fine-tuning and Online-learning

09/29/2021
by   Junyu Chen, et al.
4

A convolutional neural network (ConvNet) is usually trained and then tested using images drawn from the same distribution. To generalize a ConvNet to various tasks often requires a complete training dataset that consists of images drawn from different tasks. In most scenarios, it is nearly impossible to collect every possible representative dataset as a priori. The new data may only become available after the ConvNet is deployed in clinical practice. ConvNet, however, may generate artifacts on out-of-distribution testing samples. In this study, we present Targeted Gradient Descent (TGD), a novel fine-tuning method that can extend a pre-trained network to a new task without revisiting data from the previous task while preserving the knowledge acquired from previous training. To a further extent, the proposed method also enables online learning of patient-specific data. The method is built on the idea of reusing a pre-trained ConvNet's redundant kernels to learn new knowledge. We compare the performance of TGD to several commonly used training approaches on the task of Positron emission tomography (PET) image denoising. Results from clinical images show that TGD generated results on par with training-from-scratch while significantly reducing data preparation and network training time. More importantly, it enables online learning on the testing study to enhance the network's generalization capability in real-world applications.

READ FULL TEXT

page 5

page 7

page 8

page 9

research
12/31/2019

Side-Tuning: Network Adaptation via Additive Side Networks

When training a neural network for a desired task, one may prefer to ada...
research
10/11/2022

A Kernel-Based View of Language Model Fine-Tuning

It has become standard to solve NLP tasks by fine-tuning pre-trained lan...
research
09/01/2018

Data Dropout: Optimizing Training Data for Convolutional Neural Networks

Deep learning models learn to fit training data while they are highly ex...
research
11/19/2021

An Analysis of the Influence of Transfer Learning When Measuring the Tortuosity of Blood Vessels

Characterizing blood vessels in digital images is important for the diag...
research
05/19/2023

Conditional Online Learning for Keyword Spotting

Modern approaches for keyword spotting rely on training deep neural netw...
research
09/17/2017

Neural Affine Grayscale Image Denoising

We propose a new grayscale image denoiser, dubbed as Neural Affine Image...
research
03/10/2020

FOAL: Fast Online Adaptive Learning for Cardiac Motion Estimation

Motion estimation of cardiac MRI videos is crucial for the evaluation of...

Please sign up or login with your details

Forgot password? Click here to reset