Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Recent Advances in Scene Image Representation and Classification (2206.07326v2)

Published 15 Jun 2022 in cs.CV

Abstract: With the rise of deep learning algorithms nowadays, scene image representation methods have achieved a significant performance boost in classification. However, the performance is still limited because the scene images are mostly complex having higher intra-class dissimilarity and inter-class similarity problems. To deal with such problems, there have been several methods proposed in the literature with their advantages and limitations. A detailed study of previous works is necessary to understand their advantages and disadvantages in image representation and classification problems. In this paper, we review the existing scene image representation methods that are being widely used for image classification. For this, we, first, devise the taxonomy using the seminal existing methods proposed in the literature to this date {using deep learning (DL)-based, computer vision (CV)-based, and search engine (SE)-based methods}. Next, we compare their performance both qualitatively (e.g., quality of outputs, pros/cons, etc.) and quantitatively (e.g., accuracy). Last, we speculate on the prominent research directions in scene image representation tasks using {keyword growth and timeline analysis.} Overall, this survey provides in-depth insights and applications of recent scene image representation methods under three different methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (15)
  1. McCulloch WS, Pitts W (1943) A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics 5(4):115–133
  2. Anu E, Anu K (2016) A survey on scene recognition. Int J Sci Eng Technol Res(IJSETR) 5:64–68
  3. Shahi TB, Sitaula C (2021) Natural language processing for nepali text: a review. Artificial Intelligence Review pp 1–29
  4. Ringnér M (2008) What is principal component analysis? Nature biotechnology 26(3):303–304
  5. Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42(3):145–175
  6. Wu J, Rehg JM (2011) Centrist: a visual descriptor for scene categorization. IEEE Trans Pattern Anal Mach Intell 33(8):1489–1501
  7. Sitaula C, Shahi TB (2022) Monkeypox virus detection using pre-trained deep learning-based approaches. Journal of Medical Systems 46(11):1–9
  8. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:14091556 1409.1556
  9. Guo Y, Lew MS (2016) Bag of Surrogate Parts: one inherent feature of deep cnns. In: Proc. of the British Machine Vision Conference (BMVC)
  10. Wang D, Mao K (2019) Task-generic semantic convolutional neural network for web text-aided image classification. Neurocomputing 329:103–115
  11. Kim Y (2014) Convolutional neural networks for sentence classification. arXiv preprint arXiv:14085882
  12. Wang D, Mao K (2019) Learning semantic text features for web text-aided image classification. IEEE Trans Multimedia 21(12):2985–2996
  13. Liu S, Tian G (2019) An indoor scene classification method for service robot based on cnn feature. Journal of Robotics 2019
  14. Bai S (2017) Growing random forest on deep convolutional neural networks for scene categorization. Expert systems with applications 71:279–287
  15. Aria M, Cuccurullo C (2017) bibliometrix: An r-tool for comprehensive science mapping analysis. Journal of informetrics 11(4):959–975
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Chiranjibi Sitaula (18 papers)
  2. Tej Bahadur Shahi (3 papers)
  3. Faezeh Marzbanrad (10 papers)
  4. Jagannath Aryal (10 papers)
Citations (8)