Face Mask Detection using Transfer Learning of InceptionV3 (2009.08369v2)

Published 17 Sep 2020 in cs.CV and eess.IV

Abstract: The world is facing a huge health crisis due to the rapid transmission of coronavirus (COVID-19). Several guidelines were issued by the World Health Organization (WHO) for protection against the spread of coronavirus. According to WHO, the most effective preventive measure against COVID-19 is wearing a mask in public places and crowded areas. It is very difficult to monitor people manually in these areas. In this paper, a transfer learning model is proposed to automate the process of identifying the people who are not wearing mask. The proposed model is built by fine-tuning the pre-trained state-of-the-art deep learning model, InceptionV3. The proposed model is trained and tested on the Simulated Masked Face Dataset (SMFD). Image augmentation technique is adopted to address the limited availability of data for better training and testing of the model. The model outperformed the other recently proposed approaches by achieving an accuracy of 99.9% during training and 100% during testing.

Citations (184)

View on Semantic Scholar

Summary

The paper presents a robust methodology for automating face mask detection using InceptionV3 transfer learning fine-tuned on an augmented Simulated Masked Face Dataset.
The model achieved exceptional performance metrics, including 99.92% training and 100% testing accuracies, outperforming various benchmark methods.
This system holds significant potential for real-world public health applications and can be further enhanced by expanding the dataset and integrating facial recognition capabilities.

Face Mask Detection using Transfer Learning of InceptionV3

This paper presents a robust methodology for automating the detection of individuals not wearing face masks in public spaces, leveraging the state-of-the-art deep learning model, InceptionV3, via transfer learning. The work highlights the critical need for efficient technological solutions in enforcing health guidelines established to mitigate the spread of COVID-19, particularly in crowded environments.

Methodological Overview and Dataset

The authors employ the Simulated Masked Face Dataset (SMFD), comprising 1570 images, equally divided between masked and unmasked faces. To counteract the limited dataset size driven by concerns over privacy and data security, image augmentation techniques were applied. This augmentation includes variations such as shearing, contrasting, flipping, and zooming, which substantially boosts the training dataset's diversity and aids the model in generalizing effectively.

The paper innovatively fine-tunes InceptionV3—a convolutional neural network pre-trained on ImageNet—by restructuring its final layers to incorporate average pooling, flattening, densely connected layers with ReLU activation and dropout, culminating in a softmax-activated dense layer for binary classification.

Experimental Results

The model demonstrates exceptional performance metrics, achieving 99.92% and 100% accuracies for training and testing phases, respectively. This demonstrates a significant improvement compared to several benchmarks within the domain, including Decision Trees, SVM, MobileNet variants, VGG16, and others. In terms of other performance measures such as Precision, Specificity, Intersection over Union (IoU), and Matthews Correlation Coefficient (MCC), the model also scores highly across the board, underscoring the efficacy of transfer learning in this application.

Implications and Future Directions

The deployment of this face mask detection system holds considerable potential for real-world applications, specifically enhancing government efforts to ensure compliance with public health mandates in combating COVID-19. It exemplifies how AI can play a pivotal role in public health and safety by providing automated, scalable surveillance solutions.

In future research, the authors suggest expanding the dataset and integrating the system with facial recognition capabilities to improve the accuracy and applicability further. Such extensions could facilitate refined identification processes, including the classification of mask types, which presents additional value in both workplace settings and broader community contexts.

Conclusion

By incorporating transfer learning with InceptionV3, the authors have effectively addressed the challenges posed by limited data availability and developed a high-performing model for real-time mask detection. This work not only contributes a valuable AI solution to ongoing public health efforts but also lays the groundwork for subsequent innovations in surveillance technology and public safety.