Deep Learning Techniques for Future Intelligent Cross-Media Retrieval (2008.01191v1)

Published 21 Jul 2020 in cs.IR, cs.CV, and cs.LG

Abstract: With the advancement in technology and the expansion of broadcasting, cross-media retrieval has gained much attention. It plays a significant role in big data applications and consists in searching and finding data from different types of media. In this paper, we provide a novel taxonomy according to the challenges faced by multi-modal deep learning approaches in solving cross-media retrieval, namely: representation, alignment, and translation. These challenges are evaluated on deep learning (DL) based methods, which are categorized into four main groups: 1) unsupervised methods, 2) supervised methods, 3) pairwise based methods, and 4) rank based methods. Then, we present some well-known cross-media datasets used for retrieval, considering the importance of these datasets in the context in of deep learning based cross-media retrieval approaches. Moreover, we also present an extensive review of the state-of-the-art problems and its corresponding solutions for encouraging deep learning in cross-media retrieval. The fundamental objective of this work is to exploit Deep Neural Networks (DNNs) for bridging the "media gap", and provide researchers and developers with a better understanding of the underlying problems and the potential solutions of deep learning assisted cross-media retrieval. To the best of our knowledge, this is the first comprehensive survey to address cross-media retrieval under deep learning methods.

Authors (8)

Sadaqat ur Rehman (5 papers)
Muhammad Waqas (41 papers)
Shanshan Tu (3 papers)
Anis Koubaa (49 papers)
Obaid ur Rehman (10 papers)
Jawad Ahmad (34 papers)
Muhammad Hanif (9 papers)
Zhu Han (431 papers)

Citations (6)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Deep Learning Techniques for Future Intelligent Cross-Media Retrieval (2008.01191v1)

Summary

Related Papers