Continuous-time models such as Neural ODEs and Neural Flows have shown
p...
State-of-the-art AI models largely lack an understanding of the cause-ef...
Adding interpretability to word embeddings represents an area of active
...
In recent years, several metrics have been developed for evaluating grou...
We present evidence for the existence and effectiveness of adversarial
a...
We present GetFair, a novel framework for tuning fairness of classificat...
In this paper, we introduce Integrated Directional Gradients (IDG), a me...
This work quantifies the effects of signaling and performing gender on t...
We introduce POLAR - a framework that adds interpretability to pre-train...
Wikipedia can easily be justified as a behemoth, considering the sheer v...
A `peer-review system' in the context of judging research contributions,...