Maintaining Discrimination and Fairness in Class Incremental Learning (1911.07053v1)

Published 16 Nov 2019 in cs.CV

Abstract: Deep neural networks (DNNs) have been applied in class incremental learning, which aims to solve common real-world problems of learning new classes continually. One drawback of standard DNNs is that they are prone to catastrophic forgetting. Knowledge distillation (KD) is a commonly used technique to alleviate this problem. In this paper, we demonstrate it can indeed help the model to output more discriminative results within old classes. However, it cannot alleviate the problem that the model tends to classify objects into new classes, causing the positive effect of KD to be hidden and limited. We observed that an important factor causing catastrophic forgetting is that the weights in the last fully connected (FC) layer are highly biased in class incremental learning. In this paper, we propose a simple and effective solution motivated by the aforementioned observations to address catastrophic forgetting. Firstly, we utilize KD to maintain the discrimination within old classes. Then, to further maintain the fairness between old classes and new classes, we propose Weight Aligning (WA) that corrects the biased weights in the FC layer after normal training process. Unlike previous work, WA does not require any extra parameters or a validation set in advance, as it utilizes the information provided by the biased weights themselves. The proposed method is evaluated on ImageNet-1000, ImageNet-100, and CIFAR-100 under various settings. Experimental results show that the proposed method can effectively alleviate catastrophic forgetting and significantly outperform state-of-the-art methods.

Authors (5)

Bowen Zhao (44 papers)
Xi Xiao (82 papers)
Guojun Gan (8 papers)
Bin Zhang (227 papers)
Shutao Xia (25 papers)

Citations (374)

View on Semantic Scholar

Summary

Insights on Maintaining Discrimination and Fairness in Class Incremental Learning

The paper "Maintaining Discrimination and Fairness in Class Incremental Learning" addresses the challenging issue of catastrophic forgetting in the context of class incremental learning with Deep Neural Networks (DNNs). The authors, Zhao et al., propose an innovative approach called Weight Aligning (WA) to maintain both discrimination and fairness in model predictions, effectively mitigating the problem of bias in the classification of new and old classes.

Key Contributions

This research identifies and tackles a critical limitation found within traditional class incremental learning frameworks. While Knowledge Distillation (KD) is acknowledged for preserving discrimination within old classes, the paper highlights its insufficiency in preventing the skew towards newly introduced classes—a phenomenon induced by biased weights in the final classifier layer of the learning model. Here are the primary contributions outlined by the authors:

Investigation of KD Effects: The paper critically evaluates the dual roles of KD. While KD is effective in maintaining discrimination within previously learned classes, it inadequately addresses the bias which results in a stronger pull towards categorizing unknown inputs as new classes.
Introduction of Weight Aligning (WA): WA is a computationally straightforward method to recalibrate the weights in the model's fully connected layer post-training. By correcting the weight biases, WA ensures the model's outputs adequately respect the balance between old and new classes, leading to improved prediction fairness.
Simplified Approach Without Additional Resource Requirements: WA requires no extra parameters, hyperparameters, or splits in the validation set, making it a resource-efficient method suitable for integration into existing class incremental learning systems.
Comprehensive Experimentation and Validation: The method was rigorously tested on diverse datasets including ImageNet and CIFAR-100. On ImageNet-1000, the proposed solutions marked a substantial performance boost over previous methods, emphasizing the empirical soundness of WA.

Theoretical and Practical Implications

The theoretical contribution of the paper extends beyond empirical results by providing a critical reflection on the internal dynamics of model weights in the class incremental learning paradigm. The approach of analyzing weights' norms and proposing post-training corrections introduces a new lens for evaluating learning fairness and bias, which could influence future innovation in model fine-tuning.

Practically, this paper proposes an adaptable framework that can be absorbed efficiently into pre-existing learning pipelines. The ease of integration and absence of complex computational overhead promote WA (and the larger approach) as a feasible solution for large-scale, dynamic real-world applications where class incremental learning is imperative.

Future Prospects

The implications of this work open several avenues for further research. Firstly, a deeper investigation into the long-term impacts of accumulated errors across multiple incremental learning steps could prove insightful. Additionally, exploring the intersection of WA methods with other strategies such as generative replay or model pre-training might yield robust hybrids, pushing the limits of continual learning. Moreover, research could focus on adapting WA-like corrections for different architectural choices beyond typical DNN structures.

In summary, this paper presents a meticulous investigation and solution to a fundamental problem in incremental learning contexts, embodying methodological clarity and practical viability that could underpin future advancements in the field.

PDF Markdown

Related Papers

Find Related Papers