A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (1912.11186v3)

Published 24 Dec 2019 in cs.CV

Abstract: Recently proposed methods for weakly-supervised semantic segmentation have achieved impressive performance in predicting pixel classes despite being trained with only image labels which lack positional information. Because image annotations are cheaper and quicker to generate, weak supervision is more practical than full supervision for training segmentation algorithms. These methods have been predominantly developed to solve the background separation and partial segmentation problems presented by natural scene images and it is unclear whether they can be simply transferred to other domains with different characteristics, such as histopathology and satellite images, and still perform well. This paper evaluates state-of-the-art weakly-supervised semantic segmentation methods on natural scene, histopathology, and satellite image datasets and analyzes how to determine which method is most suitable for a given dataset. Our experiments indicate that histopathology and satellite images present a different set of problems for weakly-supervised semantic segmentation than natural scene images, such as ambiguous boundaries and class co-occurrence. Methods perform well for datasets they were developed on, but tend to perform poorly on other datasets. We present some practical techniques for these methods on unseen datasets and argue that more work is needed for a generalizable approach to weakly-supervised semantic segmentation. Our full code implementation is available on GitHub: https://github.com/lyndonchan/wsss-analysis.

Authors (3)

Lyndon Chan (3 papers)
Mahdi S. Hosseini (33 papers)
Konstantinos N. Plataniotis (109 papers)

Citations (92)

View on Semantic Scholar

Summary

Analyzing Weakly-Supervised Semantic Segmentation Across Image Domains

The paper conducted by Chan et al. presents a thorough evaluation of weakly-supervised semantic segmentation (WSSS) methodologies across various image domains, namely natural scene, histopathology, and satellite images. The primary motivation stems from the practical benefits of weak supervision, where image-level annotations, significantly less expensive than pixel-level labeling required for full supervision, still enable the training of segmentation algorithms. This research questions the generalizability of WSSS methods, predominantly developed for natural scene images, to other domains characterized by distinct challenges, such as ambiguous boundaries and high class co-occurrence.

Key Contributions and Findings

The paper illuminates several critical insights through its comprehensive experimental setup:

Dataset and Method Selection: The authors selected three representative datasets—Atlas of Digital Pathology (ADP), PASCAL VOC2012 for natural scenes, and DeepGlobe for satellite images. They evaluated state-of-the-art WSSS methods: SEC, DSRG, and HistoSegNet. SEC and DSRG, created originally for natural scenes, utilize a self-supervised learning approach where initial coarse seeds from a classification network are refined through a segmentation network. HistoSegNet, designed for histopathology, employs a simpler method, refining class activation maps with a dense CRF post-processing step.
Numerical Results: The empirical results suggest that methods perform optimally on their intended dataset domains, with SEC and DSRG showing superior performance on natural scenes while HistoSegNet excels on histopathological images. Notably, a method's success heavily depends on the sparsity of its initial class activation cues relative to the ground-truth segments.
Effectiveness of Self-Supervised Learning: While effective on datasets where initial cues have low recall (e.g., natural scenes), self-supervised strategies tend to degrade performance when the cues already cover a significant portion of the intended segments, as observed in histopathology datasets. The implication is that a non-self-supervised approach might best suit datasets exhibiting high initial recall.
Class Co-occurrence Challenge: High class co-occurrence was identified as a substantial obfuscating factor particularly in satellite imagery datasets. The authors demonstrated that by reducing image-label co-occurrence during training, the semantic segmentation performance improved for affected classes. This suggests a need for methodologies that can effectively discern and segment over-abundant co-occurring classes.

Broader Implications and Future Directions

This paper delivers actionable insights on the adaptation of WSSS methodologies across varying image types, establishing a foundational understanding that could promote the development of more domain-agnostic segmentation methods. Key implications include:

Cross-Domain Adaptation: There is a demand for novel segmentation approaches tailored to diverse image datasets, especially those that can address domain-specific challenges such as contour ambiguity and abundant class overlaps.
Methodological Innovation: Future work should focus on integrating sophisticated class co-occurrence management and leveraging domain-specific annotation structures — for instance, hierarchical annotations in histopathology.

In conclusion, while current WSSS methodologies have shown competence within their original domains, this paper by Chan et al. reveals the intricacies of cross-domain application, highlighting the necessity for broader approaches and innovative methodologies that transcend image domain boundaries.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - lyndonchan/wsss-analysis: A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission) (49 stars)