Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Heterogeneous Semantic Transfer for Multi-label Recognition with Partial Labels (2205.11131v4)

Published 23 May 2022 in cs.CV

Abstract: Multi-label image recognition with partial labels (MLR-PL), in which some labels are known while others are unknown for each image, may greatly reduce the cost of annotation and thus facilitate large-scale MLR. We find that strong semantic correlations exist within each image and across different images, and these correlations can help transfer the knowledge possessed by the known labels to retrieve the unknown labels and thus improve the performance of the MLR-PL task (see Figure 1). In this work, we propose a novel heterogeneous semantic transfer (HST) framework that consists of two complementary transfer modules that explore both within-image and cross-image semantic correlations to transfer the knowledge possessed by known labels to generate pseudo labels for the unknown labels. Specifically, an intra-image semantic transfer (IST) module learns an image-specific label co-occurrence matrix for each image and maps the known labels to complement the unknown labels based on these matrices. Additionally, a cross-image transfer (CST) module learns category-specific feature-prototype similarities and then helps complement the unknown labels that have high degrees of similarity with the corresponding prototypes. Finally, both the known and generated pseudo labels are used to train MLR models. Extensive experiments conducted on the Microsoft COCO, Visual Genome, and Pascal VOC 2007 datasets show that the proposed HST framework achieves superior performance to that of current state-of-the-art algorithms. Specifically, it obtains mean average precision (mAP) improvements of 1.4%, 3.3%, and 0.4% on the three datasets over the results of the best-performing previously developed algorithm.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (54)
  1. arXiv preprint arXiv:1412.7755 (2014)
  2. In: CVPR 2011, pp. 2801–2808. IEEE (2011)
  3. In: Advances in neural information processing systems, pp. 190–198 (2011)
  4. Trends in Practical Applications of Agents and Multiagent Systems pp. 181–188 (2013)
  5. IEEE Transactions on Pattern Analysis and Machine Intelligence 44(3), 1371–1384 (2022). DOI 10.1109/TPAMI.2020.3025814
  6. In: Proceedings of the AAAI conference on artificial intelligence (2022)
  7. IEEE Transactions on Pattern Analysis and Machine Intelligence 44(12), 9887–9903 (2022)
  8. In: Proc. of AAAI Conference on Artificial Intelligence, pp. 6730–6737 (2018)
  9. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 522–531 (2019)
  10. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021)
  11. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5177–5186 (2019)
  12. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pp. 248–255. IEEE (2009)
  13. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 647–657 (2019)
  14. International journal of computer vision 88(2), 303–338 (2010)
  15. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778 (2016)
  16. Neural computation 9(8), 1735–1780 (1997)
  17. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9423–9432 (2020)
  18. In: European Conference on Computer Vision, pp. 67–84. Springer (2016)
  19. In: P.L. Bartlett, F.C.N. Pereira, C.J.C. Burges, L. Bottou, K.Q. Weinberger (eds.) Proceedings of Advances in Neural Information Processing Systems, pp. 2654–2662 (2012)
  20. In: Y. Bengio, Y. LeCun (eds.) 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings (2015)
  21. International Journal of Computer Vision 123(1), 32–73 (2017)
  22. International Journal of Computer Vision 128(7), 1937–1955 (2020)
  23. In: 2010 Second International Conference on Multimedia and Information Technology, vol. 2, pp. 10–13. IEEE (2010)
  24. International Journal of Computer Vision 127(6-7), 884–906 (2019)
  25. In: International Conference on Learning Representations (2016)
  26. In: Proceedings of the IEEE international conference on computer vision, pp. 2980–2988 (2017)
  27. In: European Conference on Computer Vision, pp. 740–755. Springer (2014)
  28. IEEE Transactions on Intelligent Transportation Systems 20(10), 3875–3887 (2019)
  29. Multimedia Tools and Applications 81(6), 7917–7940 (2022)
  30. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP), pp. 1532–1543 (2014)
  31. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, pp. 2091–2098 (2022)
  32. IEEE Transactions on Image Processing (2024)
  33. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 82–91 (2021)
  34. International Journal of Computer Vision 130(2), 455–477 (2022)
  35. In: Y. Bengio, Y. LeCun (eds.) International Conference on Learning Representations (2015)
  36. In: Proceedings of the IEEE international conference on computer vision, pp. 843–852 (2017)
  37. In: Proceedings of the twenty-fourth AAAI conference on artificial intelligence, pp. 593–598 (2010)
  38. International Journal of Data Warehousing and Mining (IJDWM) 3(3), 1–13 (2007)
  39. International Journal of Computer Vision 130, 2924–2939 (2022)
  40. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2285–2294 (2016)
  41. In: European Conference on Computer Vision, pp. 425–439. Springer (2014)
  42. In: Computer Vision (ICCV), 2017 IEEE International Conference on, pp. 464–472. IEEE (2017)
  43. IEEE transactions on pattern analysis and machine intelligence 38(9), 1901–1907 (2016)
  44. In: Proceedings of the IEEE international conference on computer vision, pp. 4157–4165 (2015)
  45. In: Proceedings of IEEE International Conference on Computer Vision, pp. 4157–4165 (2015)
  46. In: Advances in neural information processing systems, pp. 2301–2309 (2013)
  47. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 280–288 (2016)
  48. In: European Conference on Computer Vision, pp. 649–665. Springer (2020)
  49. In: International conference on machine learning, pp. 593–601 (2014)
  50. arXiv preprint arXiv:2108.05274 (2021)
  51. In: 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), vol. 2, pp. 288–295. IEEE (2014)
  52. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2921–2929 (2016)
  53. International Journal of Computer Vision 130(9), 2337–2348 (2022)
  54. International journal of computer vision 129, 1580–1595 (2021)
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Tianshui Chen (51 papers)
  2. Tao Pu (13 papers)
  3. Lingbo Liu (40 papers)
  4. Yukai Shi (44 papers)
  5. Zhijing Yang (35 papers)
  6. Liang Lin (318 papers)
Citations (12)