Benchmarking Segmentation Models with Mask-Preserved Attribute Editing (2403.01231v2)

Published 2 Mar 2024 in cs.CV

Abstract: When deploying segmentation models in practice, it is critical to evaluate their behaviors in varied and complex scenes. Different from the previous evaluation paradigms only in consideration of global attribute variations (e.g. adverse weather), we investigate both local and global attribute variations for robustness evaluation. To achieve this, we construct a mask-preserved attribute editing pipeline to edit visual attributes of real images with precise control of structural information. Therefore, the original segmentation labels can be reused for the edited images. Using our pipeline, we construct a benchmark covering both object and image attributes (e.g. color, material, pattern, style). We evaluate a broad variety of semantic segmentation models, spanning from conventional close-set models to recent open-vocabulary large models on their robustness to different types of variations. We find that both local and global attribute variations affect segmentation performances, and the sensitivity of models diverges across different variation types. We argue that local attributes have the same importance as global attributes, and should be considered in the robustness evaluation of segmentation models. Code: https://github.com/PRIS-CV/Pascal-EA.

References (73)

Authors (5)

Zijin Yin (5 papers)
Kongming Liang (29 papers)
Bing Li (374 papers)
Zhanyu Ma (103 papers)
Jun Guo (130 papers)

Citations (1)

View on Semantic Scholar

Summary

Benchmarking Segmentation Models with Mask-Preserved Attribute Editing

Overview

Benchmarking the robustness of segmentation models is pivotal, especially when evaluating their resilience against attribute variations. This paper introduces a novel mask-preserved attribute editing pipeline to assess segmentation models' robustness by considering both local and global attribute variations. By exploiting a pre-trained diffusion model and text instructions, it allows for precise editing of visual attributes in images while preserving structural information. Consequently, this approach facilitates the reuse of original segmentation labels for the edited images, making it a potentially transformative tool for the field.

Robustness Evaluation via Attribute Variations

The core challenge in evaluating segmentation model robustness has been the scarcity of high-quality, varied test data reflecting both local and global attribute changes. The traditional datasets and benchmarks inadequately address this need, as they mainly focus on global variations like weather conditions and fail to accommodate local attribute variations such as color, material, and pattern changes in objects within the scene.

The proposed pipeline addresses these limitations by allowing for the generation of test images encompassing a wide range of attribute variations. This paper's experiments demonstrate that both local and global attribute changes significantly impact segmentation performance. Notably, models have shown varying sensitivities to different types of attribute changes, with object material variations causing the most pronounced performance declines. These findings underscore the crucial role of object attributes in segmentation robustness, challenging the prevailing emphasis solely on global attributes.

Methodological Contributions

This research makes several notable contributions. Firstly, it introduces a mask-preserved attribute editing pipeline that enables the generation of images with varied attributes without the need for re-annotating segmentation labels. Secondly, it explores segmentation model robustness across a breadth of object and image attribute variations, providing a comprehensive assessment framework. Lastly, extensive experiments reveal that segmentation models exhibit varying degrees of sensitivity to different attribute variations, offering insights into model robustness and potential areas for improvement.

Future Directions

This work opens numerous avenues for further research. One promising direction is the refinement of the mask-preserved attribute editing technique to minimize spurious attribute changes during the editing process. Additionally, expanding the attribute set and conducting more granular analyses of model responses to specific attribute variations could yield deeper insights. Future work could also explore the integration of the proposed pipeline into the training process as a novel data augmentation strategy to enhance model robustness.

Conclusion

In conclusion, this paper's mask-preserved attribute editing pipeline represents a significant advancement in benchmarking segmentation model robustness. By facilitating the generation of test images with a wide range of attribute variations while preserving structural integrity, this approach addresses a critical gap in the evaluation of segmentation models. The findings highlight the importance of object attributes in segmentation robustness and suggest new pathways for enhancing model performance.

PDF Markdown

GitHub

GitHub - PRIS-CV/Pascal-EA (13 stars)