Recent Advances in Scene Image Representation and Classification (2206.07326v2)
Abstract: With the rise of deep learning algorithms nowadays, scene image representation methods have achieved a significant performance boost in classification. However, the performance is still limited because the scene images are mostly complex having higher intra-class dissimilarity and inter-class similarity problems. To deal with such problems, there have been several methods proposed in the literature with their advantages and limitations. A detailed study of previous works is necessary to understand their advantages and disadvantages in image representation and classification problems. In this paper, we review the existing scene image representation methods that are being widely used for image classification. For this, we, first, devise the taxonomy using the seminal existing methods proposed in the literature to this date {using deep learning (DL)-based, computer vision (CV)-based, and search engine (SE)-based methods}. Next, we compare their performance both qualitatively (e.g., quality of outputs, pros/cons, etc.) and quantitatively (e.g., accuracy). Last, we speculate on the prominent research directions in scene image representation tasks using {keyword growth and timeline analysis.} Overall, this survey provides in-depth insights and applications of recent scene image representation methods under three different methods.
- McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics 5(4):115–133
- Anu E, Anu K (2016) A survey on scene recognition. Int J Sci Eng Technol Res(IJSETR) 5:64–68
- Shahi TB, Sitaula C (2021) Natural language processing for nepali text: a review. Artificial Intelligence Review pp 1–29
- Ringnér M (2008) What is principal component analysis? Nature biotechnology 26(3):303–304
- Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42(3):145–175
- Wu J, Rehg JM (2011) Centrist: a visual descriptor for scene categorization. IEEE Trans Pattern Anal Mach Intell 33(8):1489–1501
- Sitaula C, Shahi TB (2022) Monkeypox virus detection using pre-trained deep learning-based approaches. Journal of Medical Systems 46(11):1–9
- Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:14091556 1409.1556
- Guo Y, Lew MS (2016) Bag of Surrogate Parts: one inherent feature of deep cnns. In: Proc. of the British Machine Vision Conference (BMVC)
- Wang D, Mao K (2019) Task-generic semantic convolutional neural network for web text-aided image classification. Neurocomputing 329:103–115
- Kim Y (2014) Convolutional neural networks for sentence classification. arXiv preprint arXiv:14085882
- Wang D, Mao K (2019) Learning semantic text features for web text-aided image classification. IEEE Trans Multimedia 21(12):2985–2996
- Liu S, Tian G (2019) An indoor scene classification method for service robot based on cnn feature. Journal of Robotics 2019
- Bai S (2017) Growing random forest on deep convolutional neural networks for scene categorization. Expert systems with applications 71:279–287
- Aria M, Cuccurullo C (2017) bibliometrix: An r-tool for comprehensive science mapping analysis. Journal of informetrics 11(4):959–975
- Chiranjibi Sitaula (18 papers)
- Tej Bahadur Shahi (3 papers)
- Faezeh Marzbanrad (11 papers)
- Jagannath Aryal (10 papers)