Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition (2109.14710v2)

Published 29 Sep 2021 in cs.CV and cs.LG

Abstract: Modern Convolutional Neural Network (CNN) architectures, despite their superiority in solving various problems, are generally too large to be deployed on resource constrained edge devices. In this paper, we reduce memory usage and floating-point operations required by convolutional layers in CNNs. We compress these layers by generalizing the Kronecker Product Decomposition to apply to multidimensional tensors, leading to the Generalized Kronecker Product Decomposition (GKPD). Our approach yields a plug-and-play module that can be used as a drop-in replacement for any convolutional layer. Experimental results for image classification on CIFAR-10 and ImageNet datasets using ResNet, MobileNetv2 and SeNet architectures substantiate the effectiveness of our proposed approach. We find that GKPD outperforms state-of-the-art decomposition methods including Tensor-Train and Tensor-Ring as well as other relevant compression methods such as pruning and knowledge distillation.

Authors (4)

Marawan Gamal Abdel Hameed (4 papers)
Marzieh S. Tahaei (6 papers)
Ali Mosleh (10 papers)
Vahid Partovi Nia (40 papers)

Citations (19)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition (2109.14710v2)

Summary

Related Papers