Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 80 tok/s

Gemini 2.5 Pro 60 tok/s Pro

GPT-5 Medium 23 tok/s Pro

GPT-5 High 26 tok/s Pro

GPT-4o 87 tok/s Pro

Kimi K2 173 tok/s Pro

GPT OSS 120B 433 tok/s Pro

Claude Sonnet 4 36 tok/s Pro

2000 character limit reached

Fine-Grained Visual Classification of Aircraft (1306.5151v1)

Published 21 Jun 2013 in cs.CV

Abstract: This paper introduces FGVC-Aircraft, a new dataset containing 10,000 images of aircraft spanning 100 aircraft models, organised in a three-level hierarchy. At the finer level, differences between models are often subtle but always visually measurable, making visual recognition challenging but possible. A benchmark is obtained by defining corresponding classification tasks and evaluation protocols, and baseline results are presented. The construction of this dataset was made possible by the work of aircraft enthusiasts, a strategy that can extend to the study of number of other object classes. Compared to the domains usually considered in fine-grained visual classification (FGVC), for example animals, aircraft are rigid and hence less deformable. They, however, present other interesting modes of variation, including purpose, size, designation, structure, historical style, and branding.

Citations (1,975)

View on Semantic Scholar

Summary

The paper introduces the FGVC-Aircraft dataset with 10,000 images covering 100 model variants, 70 families, and 30 manufacturers.
It details a robust methodology using diversity maximization and crowdsourced bounding box annotations to ensure high-quality data.
A baseline nonlinear SVM with multi-scale dense SIFT achieved 48.69% accuracy on model variant recognition, setting a benchmark for FGVC research.

Fine-Grained Visual Classification of Aircraft: An Overview

The paper "Fine-Grained Visual Classification of Aircraft" presents a specialized dataset named FGVC-Aircraft which is designed to address the challenges in fine-grained visual classification (FGVC) within the domain of aircraft recognition. The necessity of such a dataset stems from the intricacies involved in distinguishing between similar yet distinct visual classes. The dataset encompasses 10,000 images across 100 aircraft models, organized into a hierarchical taxonomy ranging from manufacturers, families, and down to model variants.

Dataset Composition and Structure

FGVC-Aircraft is meticulously organized into a three-tier hierarchy:

Model Variants: The most granular level, consisting of 100 distinct variants.
Families: Consists of 70 groupings, where each family encompasses several closely related model variants.
Manufacturers: The broadest category, including 30 manufacturers who produce these aircraft.

Each image is annotated with its model and bounding box, facilitating precise identification tasks. The dataset is equally partitioned into training, validation, and test sets, ensuring a balanced and comprehensive evaluation framework.

Data Collection and Curation Methodology

The dataset leverages images from aircraft spotters and enthusiasts, capitalizing on the robust annotations these communities provide. The images were sourced primarily from Airliners.net, leading to a diverse collection spanning several years and various regions. This approach, however, necessitated explicit consent from photographers, thus ensuring the ethical use of the data for research purposes.

A critical step in curating the dataset was maximizing diversity. Given the potential regional and temporal biases inherent in using data from a limited number of photographers, the authors employed a diversity maximization algorithm. This method ensured minimal correlation between the images by considering the time, location, airliner, and photographer information. The bounding box annotations were crowdsourced via Amazon Mechanical Turk, ensuring a high-quality annotation process through redundancy and selection based on overlap criteria.

Evaluation and Benchmarking

The paper defines three primary classification tasks within the dataset:

Model Variant Recognition
Family Recognition
Manufacturer Recognition

The performance metric employed is the class-normalized average accuracy, which provides a balanced view of classifier performance across categories. For a robust benchmarking foundation, a strong baseline classifier is evaluated. This classifier, based on a nonlinear SVM with a $\chi^2$ kernel using multi-scale dense SIFT features, achieves an average accuracy of 48.69% on model variant recognition.

Implications and Future Prospects

Noteworthy is the potential broader applicability of the authors' approach to dataset creation. The method of leveraging enthusiast data sources and ensuring internal diversity maximization can be extrapolated to other FGVC domains such as automotive or marine vessels. FGVC-Aircraft also provides a standardized testing ground to further fine-grained classification research, offering a challenging dataset where visual distinctions are intricate and require sophisticated models to resolve.

Further development goals include expanding the dataset by incorporating more models as photographer contributions increase. Additionally, the construction methodologies outlined could form a template for similar datasets in other fine-grained domains.

Conclusion

"Fine-Grained Visual Classification of Aircraft" introduces a novel and meticulously curated dataset that addresses the complexities inherent in fine-grained visual classification within the field of aircraft recognition. By leveraging the efforts of aircraft enthusiasts and integrating a rigorous annotation process, this dataset sets a high standard for FGVC research, opening avenues for exploring subtle visual distinctions across expansive visual categories.