Exploring Different Levels of Supervision for Detecting and Localizing Solar Panels on Remote Sensing Imagery

Published 19 Sep 2023 in cs.CV | (2309.10421v1)

Abstract: This study investigates object presence detection and localization in remote sensing imagery, focusing on solar panel recognition. We explore different levels of supervision, evaluating three models: a fully supervised object detector, a weakly supervised image classifier with CAM-based localization, and a minimally supervised anomaly detector. The classifier excels in binary presence detection (0.79 F1-score), while the object detector (0.72) offers precise localization. The anomaly detector requires more data for viable performance. Fusion of model results shows potential accuracy gains. CAM impacts localization modestly, with GradCAM, GradCAM++, and HiResCAM yielding superior results. Notably, the classifier remains robust with less data, in contrast to the object detector.

Abstract PDF HTML Upgrade to Chat

References (35)

Citations (1)

View on Semantic Scholar

Summary

The paper shows that CAM-based classifiers achieve superior presence detection (F1=0.791) compared to fully supervised methods, offering scalable annotation efficiency.
It demonstrates that fully supervised object detection achieves the highest localization accuracy (DICE up to 0.810) with detailed box-level annotations using Faster R-CNN.
The study finds that anomaly detection underperforms due to high false positives, underscoring the need for improved normality modeling and potential fusion approaches.

Comparative Analysis of Supervision Levels for Solar Panel Detection and Localization in Remote Sensing Imagery

Introduction

The proliferation of remote sensing data enables scalable environmental assessments but necessitates efficient extraction of actionable information regarding objects of interest, such as solar panels. This work presents a rigorous comparative evaluation of three recognition frameworks—object detection, image classification utilizing CAM, and anomaly detection—each operating under distinct supervision regimes. The explicit focus is to quantify and analyze the trade-offs between label granularity, detection/localization accuracy, and annotation cost for solar panel recognition in high-resolution satellite imagery.

Figure 1: Examples of remote sensing imagery with prominent distractors complicating object detection and localization.

Dataset and Preprocessing

The evaluation leverages the "Distributed Solar Photovoltaic Array Location and Extent Data Set for Remote Sensing Object Identification," composed of 601 RGB satellite images (0.33m GSD, $5000\times5000$ pixels) from four Californian cities. Solar panel instances are polygonally annotated, affording fine-grained ground truth for localization. Preprocessing includes non-overlapping cropping into $200\times200$ pixel tiles, aggressive class balancing, polygon-to-box conversion for detection, and binary image labeling for classification tasks.

Figure 3: Dataset illustration showing the progression from large high-res images to tile-level annotations with and without solar panels.

Methods

Fully-Supervised Object Detection

The Faster R-CNN architecture receives box-level supervision, optimized using the Adam optimizer with weighted cross-entropy and Smooth L1 localization losses. Post-processing converts predictions into binary segmentation masks for metric computation.

Weakly-Supervised Classification with CAM

A ResNet-50-based classifier operates with binary image-level presence labels. Localization is approximated via several CAM variants (GradCAM, GradCAM++, HiResCAM, FullGrad, EigenCAM, EigenGradCAM), converting activation heatmaps to segmentation maps through thresholding.

Minimally-Supervised Anomaly Detection

A VAE is parameterized with a ResNet-50 encoder and a symmetric deconvolutional decoder. Trained solely on negatives (images without solar panels), anomaly maps are derived from reconstruction errors, post-processed using CAM for spatial localization of potential panels.

Evaluation Protocol

Presence detection uses F1-score, while localization utilizes DICE and IoU restricted to true positives. The analysis encompasses hyperparameter sensitivity, threshold calibrations, and systematic decrease in training data to probe data efficiency.

Results

Detection and Localization Performance

Classification with CAM yields superior presence detection (F1 = 0.791), even outperforming fully supervised detection (F1 = 0.720). Contrariwise, Faster R-CNN achieves the most precise localization (DICE = 0.722 with polygons, 0.810 with boxes), benefiting from spatially explicit supervision.

Figure 5: Localization results from the object detector, with both correct (a-c) and incorrect (d-f) predictions.

CAM-based classifiers, particularly with GradCAM++ and HiResCAM, localize solar panels with moderate accuracy (DICE ≈ 0.39). However, qualitative analysis reveals they reliably activate on solar panel regions and can focus only on a subset of panels if multiple instances exist per image—this is rooted in the global image-level supervision.

Figure 7: Detection explanations via GradCAM for correct (a-c) and incorrect (d-f) classifications.

Figure 2: False positive classifier activations where strong visual similarities to solar panels result in errors likely to confound both models and humans.

Figure 4: CAM-based heatmap visualizations with diverse explanatory methods for imagery containing numerous solar panels.

The anomaly detector exhibits poor detection/localization (F1 = 0.168, DICE = 0.174) and considerable false positives, routinely flagging swimming pools and other rare objects as anomalies—reflecting the intrinsic ambiguity of one-class learning without positive exemplars.

Figure 6: Anomaly detection heatmaps highlight both solar panels (a-c) and frequent spurious anomalies such as swimming pools (d-f).

Symmetry and Model Complementarity

Error analysis reveals that classification and detection often err on disparate samples, while anomaly detection predominantly fails independently of the former two. Error asymmetry analysis suggests complementary decision fusion could modestly enhance performance.

Data Efficiency

The classification model demonstrates remarkable data efficiency—retaining moderate accuracy with significantly reduced training data, whereas detection and anomaly models degrade rapidly below 40% of the data. At 40% data, classification attains F1 = 0.643, compared to 0.314 for detection and 0.140 for anomaly detection.

Computational Efficiency

Classification models train and evaluate substantially faster than detection or anomaly systems. The choice of CAM method directly impacts test-time cost—HiResCAM, EigenCAM, and FullGrad increase evaluation time significantly compared to GradCAM-derived explanations.

Implications and Future Directions

The empirical findings underscore the practical value of weakly supervised classification with CAM-based explainability for large-scale solar panel mapping—this approach substantially lowers annotation cost and can scale with less-labeled data, though at a trade-off of less precise localization. The modest influence of CAM variability indicates that choice among top methods (GradCAM, GradCAM++, HiResCAM) is less critical for remote sensing imagery.

Fully supervised detection remains preferable when fine localization is paramount, with the caveat of increased annotation and computational demands. Anomaly detection, without access to positive exemplars or fine-grained negative class constraints, underperforms and demands methodological advances (e.g., robust normality modeling, better out-of-distribution discrimination) to be viable for practical remote sensing.

Future exploration should consider semi-supervised or active learning to further reduce annotation burden, improved fusion strategies for leveraging complementary model strengths, and domain-adaptive explainability tailored for diverse object morphologies inherent in remote sensing.

Conclusion

This comparative study rigorously delineates the operational regimes, strengths, and limitations of detection, classification-with-CAM, and anomaly detection for solar panel identification in satellite imagery. Classification with CAMs offers a compelling balance of accuracy and annotation cost for detection-oriented tasks, while fully supervised detection provides the highest localization fidelity. The experiments reinforce the notion that supervision granularity should be dictated by application-driven localization needs and available labeling resources; nuanced multi-method approaches, potentially incorporating model fusion or semi-supervision, signify productive avenues for future research in scalable remote object recognition.

Markdown

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Paper Prompts

Top Community Prompts

Explain it Like I'm 14

off on

Knowledge Gaps

off on

Practical Applications

off on

Glossary

off on

Conceptual Simplification

off on

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Generate Now

Exploring Different Levels of Supervision for Detecting and Localizing Solar Panels on Remote Sensing Imagery

Summary

Comparative Analysis of Supervision Levels for Solar Panel Detection and Localization in Remote Sensing Imagery

Introduction

Dataset and Preprocessing

Methods

Fully-Supervised Object Detection

Weakly-Supervised Classification with CAM

Minimally-Supervised Anomaly Detection

Evaluation Protocol

Results

Detection and Localization Performance

Symmetry and Model Complementarity

Data Efficiency

Computational Efficiency

Implications and Future Directions

Conclusion

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Authors (3)

Collections

Exploring Different Levels of Supervision for Detecting and Localizing Solar Panels on Remote Sensing Imagery

Summary

Comparative Analysis of Supervision Levels for Solar Panel Detection and Localization in Remote Sensing Imagery

Introduction

Dataset and Preprocessing

Methods

Fully-Supervised Object Detection

Weakly-Supervised Classification with CAM

Minimally-Supervised Anomaly Detection

Evaluation Protocol

Results

Detection and Localization Performance

Symmetry and Model Complementarity

Data Efficiency

Computational Efficiency

Implications and Future Directions

Conclusion

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Related Papers

Authors (3)

Collections