Publication

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention

Related publications (58)

About
Privacy
Disclaimer

Graph Chatbot

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

Publications related to Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention | EPFL Graph Search

Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention

Graph Chatbot

Chat with Graph Search

Random matrix methods for high-dimensional machine learning models

Generalization of Scaled Deep ResNets in the Mean-Field Regime

Randomized low-rank approximation and its applications

Sparse autoregressive neural networks for classical spin systems

Transformer Models for Vision

Fundamental Limits in Statistical Learning Problems: Block Models and Neural Networks

Deep Learning Generalization with Limited and Noisy Labels

DARE-GRAM : Unsupervised Domain Adaptation Regression by Aligning Inverse Gram Matrices

Learned Compressive Representations for Single-Photon 3D Imaging

Linear Complexity Self-Attention With 3rd Order Polynomials