Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Model Evaluation for Domain Identification of Unknown Classes in Open-World Recognition: A Proposal (2312.05454v1)

Published 9 Dec 2023 in cs.CV, cs.AI, and cs.LG

Abstract: Open-World Recognition (OWR) is an emerging field that makes a machine learning model competent in rejecting the unknowns, managing them, and incrementally adding novel samples to the base knowledge. However, this broad objective is not practical for an agent that works on a specific task. Not all rejected samples will be used for learning continually in the future. Some novel images in the open environment may not belong to the domain of interest. Hence, identifying the unknown in the domain of interest is essential for a machine learning model to learn merely the important samples. In this study, we propose an evaluation protocol for estimating a model's capability in separating unknown in-domain (ID) and unknown out-of-domain (OOD). We evaluated using three approaches with an unknown domain and demonstrated the possibility of identifying the domain of interest using the pre-trained parameters through traditional transfer learning, Automated Machine Learning (AutoML), and Nearest Class Mean (NCM) classifier with First Integer Neighbor Clustering Hierarchy (FINCH). We experimented with five different domains: garbage, food, dogs, plants, and birds. The results show that all approaches can be used as an initial baseline yielding a good accuracy. In addition, a Balanced Accuracy (BACCU) score from a pre-trained model indicates a tendency to excel in one or more domains of interest. We observed that MobileNetV3 yielded the highest BACCU score for the garbage domain and surpassed complex models such as the transformer network. Meanwhile, our results also suggest that a strong representation in the pre-trained model is important for identifying unknown classes in the same domain. This study could open the bridge toward open-world recognition in domain-specific tasks where the relevancy of the unknown classes is vital.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. A deep learning approach for medical waste classification. Scientific Reports, 12, 2022.
  2. Dog breed classification based on deep learning. In 2020 13th International Symposium on Computational Intelligence and Design (ISCID), pages 209–212, 2020.
  3. Plant species classification using deep convolutional neural network. Biosystems Engineering, 151:72–80, November 2016.
  4. Learning and the unknown: Surveying steps toward open world recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01):9801–9807, Jul. 2019.
  5. Evaluation of various open-set medical imaging tasks with deep neural networks. ArXiv, abs/2110.10888, 2021.
  6. Evidential deep learning for open set action recognition. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 13329–13338, 2021.
  7. Towards open set recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35:1757–1772, 2013.
  8. Zhiyuan Chen and B. Liu. Lifelong machine learning, second edition. Synthesis Lectures on Artificial Intelligence and Machine Learning, 2018.
  9. Recent advances in open set recognition: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(10):3614–3631, 2021.
  10. Continual learning for robotics: Definition, framework, learning strategies, opportunities and challenges. Information Fusion, 58:52–68, 2020.
  11. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
  12. Towards Open Domain-Specific Recognition using Quad-Channel Self-Attention Reciprocal Point Learning and Autoencoder. 6 2023.
  13. Uc-owod: Unknown-classified open world object detection. ArXiv, abs/2207.11455, 2022.
  14. Detecting the unknown in object detection. ArXiv, abs/2208.11641, 2022.
  15. Adversarially learned one-class classifier for novelty detection. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3379–3388, 2018.
  16. Csi: Novelty detection via contrastive learning on distributionally shifted instances. ArXiv, abs/2007.08176, 2020.
  17. Deep anomaly detection using geometric transformations. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018.
  18. A unified survey on anomaly, novelty, open-set, and out-of-distribution detection: Solutions and future challenges. ArXiv, abs/2110.14051, 2021.
  19. Towards open world recognition. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1893–1902, 2015.
  20. Boosting deep open world recognition by clustering. IEEE Robotics and Automation Letters, 5(4):5985–5992, 2020.
  21. An open-world time-series sensing framework for embedded edge devices. In 2022 IEEE 28th International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA), pages 61–70, 2022.
  22. A lightweight open-world pest image classifier using ResNet8-based matching network and NT-Xent loss function. Expert Systems with Applications, 237:121395, March 2024.
  23. Knowledge is never enough: Towards web aided deep open world recognition. In 2019 International Conference on Robotics and Automation (ICRA), pages 9537–9543, 2019.
  24. Multi-stage deep classifier cascades for open world recognition. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, CIKM 19, page 179–188, New York, NY, USA, 2019. Association for Computing Machinery.
  25. Open-world learning and application to product classification. In The World Wide Web Conference, WWW 19, page 3413–3419, New York, NY, USA, 2019. Association for Computing Machinery.
  26. Open world long-tailed data classification through active distribution optimization. Expert Systems with Applications, 213:119054, 2023.
  27. Deep open set recognition using dynamic intra-class splitting. SN Comput. Sci., 1(2), mar 2020.
  28. Quad-channel contrastive prototype networks for open-set recognition in domain-specific tasks. IEEE Access, 11:48578–48592, 2023.
  29. The balanced accuracy and its posterior distribution. 2010 20th International Conference on Pattern Recognition, pages 3121–3124, 2010.
  30. Efficient parameter-free clustering using first neighbor relations. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
  31. Classification of Trash for Recyclability Status. https://cs229.stanford.edu/proj2016/report/ThungYang-ClassificationOfTrashForRecyclabilityStatus-report.pdf, 2016.
  32. Mostafa Mohamed. Garbage Classification (12 classes). https://www.kaggle.com/datasets/mostafaabla/garbage-classification, 2022.
  33. Recognition of multiple-food images by detecting candidate regions. In Proc. of IEEE International Conference on Multimedia and Expo (ICME), 2012.
  34. Novel dataset for fine-grained image categorization: Stanford dogs. In First Workshop on Fine-Grained Visual Categorization, IEEE Conference on Computer Vision and Pattern Recognition, Colorado Springs, CO, June 2011.
  35. VNPlant-200 – A Public and Large-Scale of Vietnamese Medicinal Plant Images Dataset. In Integrated Science in Digital Age 2020, pages 406–411. Springer International Publishing, May 2020.
  36. Gerry. BIRDS 525 SPECIES - IMAGE CLASSIFICATION. https://www.kaggle.com/datasets/gpiosenka/100-bird-species, 2022.
  37. Searching for MobileNetv3. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE, October 2019.
  38. Efficientnetv2: Smaller models and faster training. In Marina Meila and Tong Zhang, editors, Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, pages 10096–10106. PMLR, 18–24 Jul 2021.
  39. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016.
  40. Wide residual networks. ArXiv, abs/1605.07146, 2016.
  41. An image is worth 16x16 words: Transformers for image recognition at scale. ArXiv, abs/2010.11929, 2020.
  42. Swin transformer v2: Scaling up capacity and resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12009–12019, June 2022.
  43. Swish: a self-gated activation function. arXiv: Neural and Evolutionary Computing, 2017.

Summary

We haven't generated a summary for this paper yet.