Publication

Accelerator-driven Data Arrangement to Minimize Transformers Run-time on Multi-core Architectures