Hierarchical Symbolic Reasoning in Hyperbolic Space for Deep Discriminative Models
Explanations for black-box models help us understand model decisions as well as provide information on model biases and inconsistencies. Most of the current explainability techniques provide a single level of explanation, often in terms of feature importance scores or feature attention maps in input space. Our focus is on explaining deep discriminative models at multiple levels of abstraction, from fine-grained to fully abstract explanations. We achieve this by using the natural properties of hyperbolic geometry to more efficiently model a hierarchy of symbolic features and generate hierarchical symbolic rules as part of our explanations. Specifically, for any given deep discriminative model, we distill the underpinning knowledge by discretisation of the continuous latent space using vector quantisation to form symbols, followed by a hyperbolic reasoning block to induce an abstraction tree. We traverse the tree to extract explanations in terms of symbolic rules and its corresponding visual semantics. We demonstrate the effectiveness of our method on the MNIST and AFHQ high-resolution animal faces dataset. Our framework is available at <https://github.com/koriavinash1/SymbolicInterpretability>.
READ FULL TEXT