Skip to main content
Publication

Attention with Markov: A Curious Case of Single-layer Transformers