Neural networks are known to exploit spurious artifacts (or shortcuts) t...
Large transformer-based pre-trained language models have achieved impres...
Pre-trained Language Models (PTLMs) have been shown to perform well on
n...
It is widely accepted in the mode connectivity literature that when two
...
Despite the recent advancements of attention-based deep learning
archite...
As the COVID-19 pandemic sweeps across the world, it has been accompanie...