Diversity-aware Channel Pruning for StyleGAN Compression (2403.13548v1)
Abstract: StyleGAN has shown remarkable performance in unconditional image generation. However, its high computational cost poses a significant challenge for practical applications. Although recent efforts have been made to compress StyleGAN while preserving its performance, existing compressed models still lag behind the original model, particularly in terms of sample diversity. To overcome this, we propose a novel channel pruning method that leverages varying sensitivities of channels to latent vectors, which is a key factor in sample diversity. Specifically, by assessing channel importance based on their sensitivities to latent vector perturbations, our method enhances the diversity of samples in the compressed model. Since our method solely focuses on the channel pruning stage, it has complementary benefits with prior training schemes without additional training cost. Extensive experiments demonstrate that our method significantly enhances sample diversity across various datasets. Moreover, in terms of FID scores, our method not only surpasses state-of-the-art by a large margin but also achieves comparable scores with only half training iterations.
- Video face manipulation detection through ensemble of cnns. In 2020 25th International Conference on Pattern Recognition (ICPR), pages 5012–5019, 2021.
- Large scale GAN training for high fidelity natural image synthesis. In International Conference on Learning Representations, 2019.
- pi-gan: Periodic implicit generative adversarial networks for 3d-aware image synthesis. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5799–5809, 2021.
- Taming transformers for high-resolution image synthesis. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12873–12883, 2021.
- Generative adversarial nets. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2014a.
- Generative adversarial nets. Advances in neural information processing systems, 27, 2014b.
- StyleneRF: A style-based 3d aware generator for high-resolution image synthesis. In International Conference on Learning Representations, 2022.
- Ganspace: Discovering interpretable gan controls. Advances in Neural Information Processing Systems, 33:9841–9850, 2020.
- Deepfake detection algorithm based on improved vision transformer. Applied Intelligence, 53(7):7512–7527, 2023.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems, 30, 2017.
- Discriminator-cooperated feature map distillation for gan compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20351–20360, 2023.
- Frequency-based motion representation for video generative adversarial networks. IEEE Transactions on Image Processing, 2023.
- Information-theoretic gan compression with variational energy-based model. Advances in Neural Information Processing Systems, 35:18241–18255, 2022.
- A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4401–4410, 2019.
- Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8110–8119, 2020.
- Alias-free generative adversarial networks. Advances in Neural Information Processing Systems, 34:852–863, 2021.
- Exploiting spatial dimensions of latent in gan for real-time image editing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 852–861, 2021.
- Adam: A method for stochastic optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings, 2015.
- Improved precision and recall metric for assessing generative models. Advances in Neural Information Processing Systems, 32, 2019.
- Generator knows what discriminator should learn in unconditional gans. In ECCV, 2022.
- Channel pruning via gradient of mutual information for light-weight convolutional neural networks. In 2020 IEEE International Conference on Image Processing (ICIP), pages 1751–1755. IEEE, 2020.
- Revisiting discriminator in gan compression: A generator-discriminator cooperative compression scheme. Advances in Neural Information Processing Systems, 34:28560–28572, 2021.
- Learning efficient gans for image translation via differentiable masks and co-attention distillation. IEEE Transactions on Multimedia, 2022.
- Towards faster and stabilized gan training for high-fidelity few-shot image synthesis. In International Conference on Learning Representations, 2020.
- Channel pruning based on mean gradient for accelerating convolutional neural networks. Signal Processing, 156:84–91, 2019.
- Discrimination-aware network pruning for deep model compression. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(8):4035–4051, 2021a.
- Content-aware gan compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12156–12166, 2021b.
- Pruning convolutional neural networks for resource efficient inference. arXiv preprint arXiv:1611.06440, 2016.
- Semantic image synthesis with spatially-adaptive normalization. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2337–2346, 2019.
- Styleclip: Text-driven manipulation of stylegan imagery. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 2085–2094, 2021.
- Online multi-granularity distillation for gan compression. In Proceedings of the IEEE/CVF international conference on computer vision, pages 6793–6803, 2021.
- Encoding in style: a stylegan encoder for image-to-image translation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2287–2296, 2021.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10684–10695, 2022.
- Stylegan-xl: Scaling stylegan to large diverse datasets. In ACM SIGGRAPH 2022 conference proceedings, pages 1–10, 2022.
- Lifting 2d stylegan for 3d-aware face generation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6258–6266, 2021.
- Towards squeezing-averse virtual try-on via sequential deformation. arXiv preprint arXiv:2312.15861, 2023.
- meprop: Sparsified back propagation for accelerated deep learning with reduced overfitting. In International Conference on Machine Learning, pages 3299–3308. PMLR, 2017.
- Gan slimming: All-in-one gan compression by a unified optimization framework. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part IV 16, pages 54–73. Springer, 2020.
- Diffusion-gan: Training gans with diffusion. In The Eleventh International Conference on Learning Representations, 2022.
- Mind the gap in distilling stylegans. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXIII, pages 423–439. Springer, 2022.
- Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365, 2015.
- Wavelet knowledge distillation: Towards efficient image-to-image translation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12464–12474, 2022.
- In-domain gan inversion for real image editing. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVII 16, pages 592–608. Springer, 2020.
- Discrimination-aware channel pruning for deep neural networks. Advances in neural information processing systems, 31, 2018.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.