David Atienza Alonso, Giovanni Ansaloni, Miguel Peon Quiros, Flavio Ponzina
Codebook-based optimizations are a class of algorithmic-level transformations able to effectively reduce the computing and memory requirements of Convolutional Neural Networks (CNNs). This approach tightly limits the number of unique weights in each layer, ...
2022