LowDINO – A Low Parameter Self Supervised Learning Model

05/28/2023
by   Sai Krishna Prathapaneni, et al.
0

This research aims to explore the possibility of designing a neural network architecture that allows for small networks to adopt the properties of huge networks, which have shown success in self-supervised learning (SSL), for all the downstream tasks like image classification, segmentation, etc. Previous studies have shown that using convolutional neural networks (ConvNets) can provide inherent inductive bias, which is crucial for learning representations in deep learning models. To reduce the number of parameters, attention mechanisms are utilized through the usage of MobileViT blocks, resulting in a model with less than 5 million parameters. The model is trained using self-distillation with momentum encoder and a student-teacher architecture is also employed, where the teacher weights use vision transformers (ViTs) from recent SOTA SSL models. The model is trained on the ImageNet1k dataset. This research provides an approach for designing smaller, more efficient neural network architectures that can perform SSL tasks comparable to heavy models

READ FULL TEXT

page 2

page 5

research
01/12/2021

SEED: Self-supervised Distillation For Visual Representation

This paper is concerned with self-supervised learning for small models. ...
research
10/05/2022

Exploring The Role of Mean Teachers in Self-supervised Masked Auto-Encoders

Masked image modeling (MIM) has become a popular strategy for self-super...
research
04/04/2023

Deep learning for diffusion in porous media

We adopt convolutional neural networks (CNN) to predict the basic proper...
research
06/09/2023

Two Independent Teachers are Better Role Model

Recent deep learning models have attracted substantial attention in infa...
research
11/17/2022

Self-Supervised Visual Representation Learning via Residual Momentum

Self-supervised learning (SSL) approaches have shown promising capabilit...
research
09/26/2022

Learning to Learn with Generative Models of Neural Network Checkpoints

We explore a data-driven approach for learning to optimize neural networ...
research
04/23/2021

Inductive biases and Self Supervised Learning in modelling a physical heating system

Model Predictive Controllers (MPC) require a good model for the controll...

Please sign up or login with your details

Forgot password? Click here to reset