Deep Learning vs. Traditional Computer Vision (1910.13796v1)

Published 30 Oct 2019 in cs.CV and cs.LG

Abstract: Deep Learning has pushed the limits of what was possible in the domain of Digital Image Processing. However, that is not to say that the traditional computer vision techniques which had been undergoing progressive development in years prior to the rise of DL have become obsolete. This paper will analyse the benefits and drawbacks of each approach. The aim of this paper is to promote a discussion on whether knowledge of classical computer vision techniques should be maintained. The paper will also explore how the two sides of computer vision can be combined. Several recent hybrid methodologies are reviewed which have demonstrated the ability to improve computer vision performance and to tackle problems not suited to Deep Learning. For example, combining traditional computer vision techniques with Deep Learning has been popular in emerging domains such as Panoramic Vision and 3D vision for which Deep Learning models have not yet been fully optimised

Authors (8)

Niall O' Mahony (3 papers)
Sean Campbell (4 papers)
Anderson Carvalho (2 papers)
Suman Harapanahalli (1 paper)
Gustavo Velasco-Hernandez (2 papers)
Lenka Krpalkova (3 papers)
Daniel Riordan (3 papers)
Joseph Walsh (4 papers)

Citations (828)

View on Semantic Scholar

Summary

The paper reveals that deep learning’s end-to-end training, particularly via CNNs, significantly improves image processing accuracy over manual feature extraction.
The paper highlights that traditional computer vision methods offer efficiency, transparency, and lower computational demands in resource-constrained scenarios.
The paper identifies that hybrid approaches, combining deep learning and traditional techniques, can overcome individual limitations and optimize performance in complex tasks.

Deep Learning vs. Traditional Computer Vision

The paper "Deep Learning vs. Traditional Computer Vision" by O’Mahony et al. provides a comprehensive analysis comparing the advances brought by Deep Learning (DL) with traditional computer vision (CV) techniques within the domain of digital image processing. The authors examine the strengths and weaknesses of each approach while also exploring how hybrid methodologies can be leveraged to enhance performance in specific scenarios where traditional or deep learning techniques alone are insufficient.

Comparison of Deep Learning and Traditional Computer Vision

Deep Learning Overview

Deep Learning has revolutionized digital image processing tasks such as image colorization, classification, segmentation, and detection. Convolutional Neural Networks (CNNs) have enabled significant improvements in prediction accuracy, primarily by utilizing large datasets and high computational power. Unlike traditional CV methods, DL involves end-to-end training, where neural networks automatically find the most descriptive features without explicit feature extraction, therefore requiring less human fine-tuning.

Advantages of Deep Learning

DL excels in various image processing tasks due to its ability to learn and generalize based on large datasets. Notable benefits include:

Superior Accuracy: DL models, especially CNNs, deliver high performance in image classification, segmentation, and object detection tasks.
Flexibility: These models can be re-trained for specific tasks using custom datasets.
End-to-End Learning: Reduces reliance on handcrafted feature extraction, cutting down on expert analysis time.

Traditional Computer Vision Techniques

Traditional CV techniques such as Scale-Invariant Feature Transform (SIFT), Speeded-Up Robust Features (SURF), and Hough Transforms rely on handcrafted features and global feature descriptors for various tasks. Often, these methods are combined with machine learning classifiers like Support Vector Machines or K-Nearest Neighbors.

Advantages of Traditional Computer Vision

Efficiency and Simplicity: Traditional methods can sometimes solve problems more efficiently and with fewer computational resources compared to DL.
Transparency and Control: Unlike the black-box nature of DL, traditional methods offer clear insights and greater control over the process, which can be adjusted through fine-tuning of parameters.
Non-class Specific: Techniques such as SIFT can be applied equally well to different image classes and are not dependent on large datasets.

Hybrid Approaches

Hybrid methodologies combining DL and traditional CV techniques offer several advantages in specific applications:

Performance Optimization: Traditional methods can pre-process and filter input data for DL models, optimizing resource usage.
Enhanced Capability: Hybrid approaches can handle tasks where DL alone might fail due to model limitations or lack of sufficient data, e.g., SLAM, 3D vision, and panoramic stitching.
Edge Computing Synergies: Efficient deployment in edge devices benefits from the low latency of traditional methods coupled with the high accuracy of DL.

Challenges and Solutions

Deep Learning Challenges

DL entails substantial computational and data resource requirements:

High Computational Demand: Requires significant hardware power for training (GPUs, TPUs).
Data Dependency: Needs large quantities of labeled data for effective training; limited datasets can lead to overfitting.
Long Training Times: Training complex models can be time-intensive.

The paper discusses strategies to mitigate these challenges, such as leveraging transfer learning to reduce training duration and employing pre-processing techniques to improve model training and inference efficiency.

Traditional Computer Vision Limitations

While traditional methods are powerful, they have limitations in scenarios demanding complex feature extraction or where end-to-end learning provides greater efficacy:

Feature Selection: Manually determining and extracting relevant features can be cumbersome and less effective as class variety increases.
Performance with Unseen Data: Traditional methods may struggle with generalization beyond the training data compared to DL.

Emerging Fields and Future Directions

Areas such as 3D vision, SLAM, and panoramic imaging continue to pose challenges where traditional techniques and DL can be effectively combined. Developments like Geometric Deep Learning offer potential pathways for incorporating 3D data within DL frameworks. Similarly, in applications like facial recognition or autonomous navigation, hybrid models can enhance performance by leveraging the strengths of both paradigms.

Conclusion

The exploration by O’Mahony et al. underscores the continued relevance of traditional CV techniques even as DL transforms the field of computer vision. The hybridization of both approaches not only compensates for the weaknesses inherent in each method but also paves the way for more robust, versatile, and efficient solutions across various image processing applications.