Impact of Fully Connected Layers on Performance of Convolutional Neural Networks for Image Classification (1902.02771v3)

Published 21 Jan 2019 in cs.CV, cs.LG, cs.NE, and eess.IV

Abstract: The Convolutional Neural Networks (CNNs), in domains like computer vision, mostly reduced the need for handcrafted features due to its ability to learn the problem-specific features from the raw input data. However, the selection of dataset-specific CNN architecture, which mostly performed by either experience or expertise is a time-consuming and error-prone process. To automate the process of learning a CNN architecture, this paper attempts at finding the relationship between Fully Connected (FC) layers with some of the characteristics of the datasets. The CNN architectures, and recently datasets also, are categorized as deep, shallow, wide, etc. This paper tries to formalize these terms along with answering the following questions. (i) What is the impact of deeper/shallow architectures on the performance of the CNN w.r.t. FC layers?, (ii) How the deeper/wider datasets influence the performance of CNN w.r.t. FC layers?, and (iii) Which kind of architecture (deeper/ shallower) is better suitable for which kind of (deeper/ wider) datasets. To address these findings, we have performed experiments with three CNN architectures having different depths. The experiments are conducted by varying the number of FC layers. We used four widely used datasets including CIFAR-10, CIFAR-100, Tiny ImageNet, and CRCHistoPhenotypes to justify our findings in the context of the image classification problem. The source code of this research is available at https://github.com/shabbeersh/Impact-of-FC-layers.

Authors (4)

S. H. Shabbeer Basha (8 papers)
Shiv Ram Dubey (55 papers)
Viswanath Pulabaigari (10 papers)
Snehasis Mukherjee (22 papers)

Citations (338)

View on Semantic Scholar

Summary

Impact of Fully Connected Layers on Performance of Convolutional Neural Networks for Image Classification

The research paper titled "Impact of Fully Connected Layers on Performance of Convolutional Neural Networks for Image Classification" provides an analysis of the influence that fully connected (FC) layers have on convolutional neural networks (CNNs) with regards to image classification tasks. The paper is motivated by the observation that designing dataset-specific CNN architectures typically relies on experience and expertise, a process that is both time-consuming and prone to errors. To address this challenge, the authors have focused on the relationship between FC layers and certain dataset characteristics, examining factors such as network depth and dataset breadth.

The paper systematically assesses the impact of deeper versus shallower architectures on CNN performance, particularly in relation to the FC layers. It also evaluates how deeper or wider datasets influence CNN performance with respect to these layers. The authors use three CNN architectures, varying in depth, across four popular datasets: CIFAR-10, CIFAR-100, Tiny ImageNet, and CRCHistoPhenotypes.

Key Findings

Impact of Network Depth:
- The analysis reveals that deeper CNNs generally require fewer and smaller FC layers to achieve optimal performance compared to shallower networks. Deeper networks tend to learn more abstract features, which reduces the necessity for compensatory complexity in the FC layers.
Data Characteristics and Architecture Suitability:
- With respect to dataset characteristics, the paper finds that deeper datasets (those with more images per class) are better suited to deeper architectures. Conversely, wider datasets (those with more classes and fewer images per class) align better with shallower networks.
Performance Across Datasets:
- Shallow architectures often demand more complex FC layers when utilized with wider datasets, leveraging additional neurons or layers to capture intricate patterns dispersed across a broader class set.
Empirical Validation:
- Empirical results using the mentioned datasets corroborate these findings. For instance, in the case of CIFAR-10, deeper networks such as CNN-3 matched with adequate FC configurations yield more robust classification accuracy compared to shallower counterparts.

Implications and Future Directions

The findings provide valuable insights into architectural decisions in CNN design, suggesting criteria for choosing between deep and shallow networks given a dataset’s specific attributes. The results imply that a more informed selection of FC layers can not only enhance model performance but also streamline the initial architecture choice process, saving valuable time and computational resources.

Theoretically, this work contributes to our understanding of how convolutional layers, depth, and the characteristics of the FC layers interact in determining CNN performance. Practically, the guidelines emerging from this paper can significantly impact image classification tasks across different disciplines by optimizing model architecture for specific datasets.

Future directions could involve extending this analysis beyond the static characteristics of the data, perhaps incorporating dynamic dataset features or exploring the effect of emerging CNN architectures, such as those incorporating attention mechanisms. Additionally, similar studies could be carried out in other neural network domains like natural language processing, where comparable architecture design issues exist.

In summary, this research elucidates crucial considerations in the intersection of FC layers, CNN architecture depth, and dataset characteristics—offering a structured approach to architectural design in CNNs for image classification.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - shabbeersh/Impact-of-FC-layers: This research is carried out to find the necessity of fully connected (FC) layers in CNN for image classification. (14 stars)

Tweets

https://twitter.com/_RVUniversity_/status/1849047782323347652

https://twitter.com/_RVUniversity_/status/1849416357760352272