Self-training of Machine Learning Models for Liver Histopathology: Generalization under Clinical Shifts

11/14/2022
by   Jin Li, et al.
0

Histopathology images are gigapixel-sized and include features and information at different resolutions. Collecting annotations in histopathology requires highly specialized pathologists, making it expensive and time-consuming. Self-training can alleviate annotation constraints by learning from both labeled and unlabeled data, reducing the amount of annotations required from pathologists. We study the design of teacher-student self-training systems for Non-alcoholic Steatohepatitis (NASH) using clinical histopathology datasets with limited annotations. We evaluate the models on in-distribution and out-of-distribution test data under clinical data shifts. We demonstrate that through self-training, the best student model statistically outperforms the teacher with a 3% absolute difference on the macro F1 score. The best student model also approaches the performance of a fully supervised model trained with twice as many annotations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2021

Noisy student-teacher training for robust keyword spotting

We propose self-training with noisy student-teacher approach for streami...
research
06/08/2023

Teaching AI to Teach: Leveraging Limited Human Salience Data Into Unlimited Saliency-Based Training

Machine learning models have shown increased accuracy in classification ...
research
02/10/2023

Q-Match: Self-supervised Learning by Matching Distributions Induced by a Queue

In semi-supervised learning, student-teacher distribution matching has b...
research
08/25/2021

Multi-Task Self-Training for Learning General Representations

Despite the fast progress in training specialized models for various tas...
research
12/21/2021

Teacher-Student Architecture for Mixed Supervised Lung Tumor Segmentation

Purpose: Automating tasks such as lung tumor localization and segmentati...
research
07/13/2022

Wakeword Detection under Distribution Shifts

We propose a novel approach for semi-supervised learning (SSL) designed ...
research
02/14/2022

MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts

Understanding the performance of machine learning models across diverse ...

Please sign up or login with your details

Forgot password? Click here to reset