Neural Dependencies Emerging from Learning Massive Categories

11/21/2022
by   Ruili Feng, et al.
0

This work presents two astonishing findings on neural networks learned for large-scale image classification. 1) Given a well-trained model, the logits predicted for some category can be directly obtained by linearly combining the predictions of a few other categories, which we call neural dependency. 2) Neural dependencies exist not only within a single model, but even between two independently learned models, regardless of their architectures. Towards a theoretical analysis of such phenomena, we demonstrate that identifying neural dependencies is equivalent to solving the Covariance Lasso (CovLasso) regression problem proposed in this paper. Through investigating the properties of the problem solution, we confirm that neural dependency is guaranteed by a redundant logit covariance matrix, which condition is easily met given massive categories, and that neural dependency is highly sparse, implying that one category correlates to only a few others. We further empirically show the potential of neural dependencies in understanding internal data correlations, generalizing models to unseen categories, and improving model robustness with a dependency-derived regularizer. Code for this work will be made publicly available.

READ FULL TEXT

page 17

page 19

page 20

page 21

page 26

page 27

page 29

page 30

research
11/15/2022

Identifying Spurious Correlations and Correcting them with an Explanation-based Learning

Identifying spurious correlations learned by a trained model is at the c...
research
02/10/2023

A Mathematical Model of Package Management Systems – from General Event Structures to Antimatroids

This paper brings mathematical tools to bear on the study of package dep...
research
07/18/2023

Exploiting Field Dependencies for Learning on Categorical Data

Traditional approaches for learning on categorical data underexploit the...
research
05/07/2022

Comparison Knowledge Translation for Generalizable Image Classification

Deep learning has recently achieved remarkable performance in image clas...
research
02/01/2020

The Sylvester Graphical Lasso (SyGlasso)

This paper introduces the Sylvester graphical lasso (SyGlasso) that capt...
research
02/24/2022

Multiple multi-sample testing under arbitrary covariance dependency

Modern high-throughput biomedical devices routinely produce data on a la...
research
01/20/2019

Visualizing Semantic Structures of Sequential Data by Learning Temporal Dependencies

While conventional methods for sequential learning focus on interaction ...

Please sign up or login with your details

Forgot password? Click here to reset