Factors for the Generalisation of Identity Relations by Neural Networks

06/13/2019
by   Radha Kopparti, et al.
0

Many researchers implicitly assume that neural networks learn relations and generalise them to new unseen data. It has been shown recently, however, that the generalisation of feed-forward networks fails for identity relations.The proposed solution for this problem is to create an inductive bias with Differential Rectifier (DR) units. In this work we explore various factors in the neural network architecture and learning process whether they make a difference to the generalisation on equality detection of Neural Networks without and and with DR units in early and mid fusion architectures. We find in experiments with synthetic data effects of the number of hidden layers, the activation function and the data representation. The training set size in relation to the total possible set of vectors also makes a difference. However, the accuracy never exceeds 61 DR units improve generalisation in all tasks and lead to almost perfect test accuracy in the Mid Fusion setting. Thus, DR units seem to be a promising approach for creating generalisation abilities that standard networks lack.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/04/2018

Feed-Forward Neural Networks Need Inductive Bias to Learn Equality Relations

Basic binary relations such as equality and inequality are fundamental t...
research
03/06/2020

Weight Priors for Learning Identity Relations

Learning abstract and systematic relations has been an open issue in neu...
research
12/06/2018

Modelling Identity Rules with Neural Networks

In this paper, we show that standard feed-forward and recurrent neural n...
research
10/28/2022

Automated analysis of diabetic retinopathy using vessel segmentation maps as inductive bias

Recent studies suggest that early stages of diabetic retinopathy (DR) ca...
research
03/10/2021

Relational Weight Priors in Neural Networks for Abstract Pattern Learning and Language Modelling

Deep neural networks have become the dominant approach in natural langua...
research
05/24/2019

Doctor of Crosswise: Reducing Over-parametrization in Neural Networks

Dr. of Crosswise proposes a new architecture to reduce over-parametrizat...
research
06/03/2021

Advances in Classifying the Stages of Diabetic Retinopathy Using Convolutional Neural Networks in Low Memory Edge Devices

Diabetic Retinopathy (DR) is a severe complication that may lead to reti...

Please sign up or login with your details

Forgot password? Click here to reset