Recent studies on transformer-based language models show that they can a...
What is the computational model behind a Transformer? Where recurrent ne...
We present an algorithm for extracting a subclass of the context free
gr...
We develop a formal hierarchy of the expressive capacity of RNN
architec...
We present an algorithm for extraction of a probabilistic deterministic
...
While Recurrent Neural Networks (RNNs) are famously known to be Turing
c...