Towards Improved and Interpretable Deep Metric Learning via Attentive Grouping

by   Zhengyang Wang, et al.

Grouping has been commonly used in deep metric learning for computing diverse features. However, current methods are prone to overfitting and lack interpretability. In this work, we propose an improved and interpretable grouping method to be integrated flexibly with any metric learning framework. Our method is based on the attention mechanism with a learnable query for each group. The query is fully trainable and can capture group-specific information when combined with the diversity loss. An appealing property of our method is that it naturally lends itself interpretability. The attention scores between the learnable query and each spatial position can be interpreted as the importance of that position. We formally show that our proposed grouping method is invariant to spatial permutations of features. When used as a module in convolutional neural networks, our method leads to translational invariance. We conduct comprehensive experiments to evaluate our method. Our quantitative results indicate that the proposed method outperforms prior methods consistently and significantly across different datasets, evaluation metrics, base models, and loss functions. For the first time to the best of our knowledge, our interpretation results clearly demonstrate that the proposed method enables the learning of distinct and diverse features across groups.


page 3

page 5

page 6

page 10

page 11


Towards Interpretable Deep Metric Learning with Structural Matching

How do the neural networks distinguish two images? It is of critical imp...

Attention-based Ensemble for Deep Metric Learning

Recently, ensemble has been applied to deep metric learning to yield sta...

DIABLO: Dictionary-based Attention Block for Deep Metric Learning

Recent breakthroughs in representation learning of unseen classes and ex...

Spatial-Spectral Hyperspectral Classification based on Learnable 3D Group Convolution

Deep neural networks have faced many problems in hyperspectral image cla...

The Group Loss for Deep Metric Learning

Deep metric learning has yielded impressive results in tasks such as clu...

Spatial Mixture Models with Learnable Deep Priors for Perceptual Grouping

Humans perceive the seemingly chaotic world in a structured and composit...

Generalized Sum Pooling for Metric Learning

A common architectural choice for deep metric learning is a convolutiona...

Please sign up or login with your details

Forgot password? Click here to reset