Skip to main content
Publication

Revisiting Offline Compression: Going Beyond Factorization-based Methods for Transformer Language Models