Open Domain Generalization with a Single Network by Regularization Exploiting Pre-trained Features (2312.05141v1)

Published 8 Dec 2023 in cs.CV

Abstract: Open Domain Generalization (ODG) is a challenging task as it not only deals with distribution shifts but also category shifts between the source and target datasets. To handle this task, the model has to learn a generalizable representation that can be applied to unseen domains while also identify unknown classes that were not present during training. Previous work has used multiple source-specific networks, which involve a high computation cost. Therefore, this paper proposes a method that can handle ODG using only a single network. The proposed method utilizes a head that is pre-trained by linear-probing and employs two regularization terms, each targeting the regularization of feature extractor and the classification head, respectively. The two regularization terms fully utilize the pre-trained features and collaborate to modify the head of the model without excessively altering the feature extractor. This ensures a smoother softmax output and prevents the model from being biased towards the source domains. The proposed method shows improved adaptability to unseen domains and increased capability to detect unseen classes as well. Extensive experiments show that our method achieves competitive performance in several benchmarks. We also justify our method with careful analysis of the effect on the logits, features, and the head.

References (44)

Authors (3)

Inseop Chung (13 papers)
KiYoon Yoo (13 papers)
Nojun Kwak (116 papers)

Summary

Open Domain Generalization with a Single Network by Regularization Exploiting Pre-trained Features

This paper presents a novel approach to Open Domain Generalization (ODG), tackling the problems of distribution shifts and category shifts between source and target datasets, with the proposal to utilize a single neural network. Unlike preceding methods that employ multiple source-specific networks with significant computational costs, this paper proposes a more resource-efficient architecture. It integrates two regularization terms to leverage pre-trained features, enabling better generalization across unseen domains and identifying unknown classes.

Key Methodological Innovations

The paper introduces several key methodological contributions:

Network Architecture:
- Utilizes a single network comprising a feature extractor and a classifier head.
- Leverages a pre-trained head by linear-probing, avoiding a complete overhaul of the feature extractor weights and thereby preserving beneficial pre-trained features.
Regularization Terms:
- Feature Regularization: Clusters source domain features by minimizing the distance to pre-established prototypes, computed across pre-trained features.
- Head Regularization: Maximizes the entropy of the softmax output over pre-trained feature inputs, pushing the network to generalize beyond the source domains.

These regularization techniques prevent the overfitting of the network to source domains, leading to a smoother softmax output that resists biases towards known domains.

Empirical Results

The proposed method undergoes extensive testing on benchmark datasets such as PACS, Office-Home, and a Multi-Dataset scenario. Key results include:

Performance: The method achieves competitive performance, often surpassing existing ensemble methods like DAML in terms of accuracy and harmonic mean scores.
Efficiency: With only one network, the approach reduces both inference cost and computational demand significantly compared to multi-network solutions.

Analysis and Theoretical Insights

The paper provides thorough quantitative analyses on several aspects, including domain gap and intra-class variance, illustrating the efficacy of their approach in narrowing the domain gap and maintaining smaller intra-class variance.
It reports higher adaptability of the model via the proposed regularization, evidenced by smoother class probabilities and reduced bias towards source domain classes.

Implications and Future Directions

This research contributes to the ongoing discourse in domain generalization by demonstrating that sophisticated usage of pre-trained features can significantly enhance performance even with the constraint of a single network. The simplification in network architecture can have profound implications for deploying domain generalization systems in environments with limited resources such as mobile devices or IoT systems.

Theoretical implications point toward the enduring value of pre-trained models and the potential to utilize them beyond the current transfer learning paradigms. Future research could explore expanding this regularization framework to other architectures or tasks, potentially broadening the toolkit for tackling both seen and unseen classes more efficiently and effectively.

In conclusion, the paper provides a meaningful step forward in the domain generalization field, combining the efficiency of single-network solutions with the robustness required to handle open domain scenarios. This methodology sets a precedent for further reducing computational loads while enhancing system adaptability to uncharted domains and data distributions.

PDF Markdown