Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Hadamard Product for Low-rank Bilinear Pooling (1610.04325v4)

Published 14 Oct 2016 in cs.CV, cs.AI, and cs.NE

Abstract: Bilinear models provide rich representations compared with linear models. They have been applied in various visual tasks, such as object recognition, segmentation, and visual question-answering, to get state-of-the-art performances taking advantage of the expanded representations. However, bilinear representations tend to be high-dimensional, limiting the applicability to computationally complex tasks. We propose low-rank bilinear pooling using Hadamard product for an efficient attention mechanism of multimodal learning. We show that our model outperforms compact bilinear pooling in visual question-answering tasks with the state-of-the-art results on the VQA dataset, having a better parsimonious property.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jin-Hwa Kim (42 papers)
  2. Kyoung-Woon On (19 papers)
  3. Woosang Lim (6 papers)
  4. Jeonghee Kim (4 papers)
  5. Jung-Woo Ha (67 papers)
  6. Byoung-Tak Zhang (83 papers)