Publication

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention