Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Correlated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition (1604.01655v3)

Published 6 Apr 2016 in cs.CV

Abstract: In this paper, we propose a new correlated and individual multi-modal deep learning (CIMDL) method for RGB-D object recognition. Unlike most conventional RGB-D object recognition methods which extract features from the RGB and depth channels individually, our CIMDL jointly learns feature representations from raw RGB-D data with a pair of deep neural networks, so that the sharable and modal-specific information can be simultaneously exploited. Specifically, we construct a pair of deep convolutional neural networks (CNNs) for the RGB and depth data, and concatenate them at the top layer of the network with a loss function which learns a new feature space where both correlated part and the individual part of the RGB-D information are well modelled. The parameters of the whole networks are updated by using the back-propagation criterion. Experimental results on two widely used RGB-D object image benchmark datasets clearly show that our method outperforms state-of-the-arts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Ziyan Wang (42 papers)
  2. Jiwen Lu (192 papers)
  3. Ruogu Lin (3 papers)
  4. Jianjiang Feng (37 papers)
  5. Jie Zhou (687 papers)
Citations (29)

Summary

We haven't generated a summary for this paper yet.