MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition (2208.05768v1)

Published 11 Aug 2022 in cs.CV

Abstract: Unlike the conventional Knowledge Distillation (KD), Self-KD allows a network to learn knowledge from itself without any guidance from extra networks. This paper proposes to perform Self-KD from image Mixture (MixSKD), which integrates these two techniques into a unified framework. MixSKD mutually distills feature maps and probability distributions between the random pair of original images and their mixup images in a meaningful way. Therefore, it guides the network to learn cross-image knowledge by modelling supervisory signals from mixup images. Moreover, we construct a self-teacher network by aggregating multi-stage feature maps for providing soft labels to supervise the backbone classifier, further improving the efficacy of self-boosting. Experiments on image classification and transfer learning to object detection and semantic segmentation demonstrate that MixSKD outperforms other state-of-the-art Self-KD and data augmentation methods. The code is available at https://github.com/winycg/Self-KD-Lib.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (8)

Chuanguang Yang (36 papers)
Zhulin An (43 papers)
Helong Zhou (9 papers)
Linhang Cai (6 papers)
Xiang Zhi (2 papers)
Jiwen Wu (1 paper)
Yongjun Xu (81 papers)
Qian Zhang (308 papers)

Citations (40)

View on Semantic Scholar

MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition (2208.05768v1)

Related Papers