Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fair Comparison between Efficient Attentions (2206.00244v1)

Published 1 Jun 2022 in cs.CV and cs.LG

Abstract: Transformers have been successfully used in various fields and are becoming the standard tools in computer vision. However, self-attention, a core component of transformers, has a quadratic complexity problem, which limits the use of transformers in various vision tasks that require dense prediction. Many studies aiming at solving this problem have been reported proposed. However, no comparative study of these methods using the same scale has been reported due to different model configurations, training schemes, and new methods. In our paper, we validate these efficient attention models on the ImageNet1K classification task by changing only the attention operation and examining which efficient attention is better.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Jiuk Hong (2 papers)
  2. Chaehyeon Lee (4 papers)
  3. Soyoun Bang (1 paper)
  4. Heechul Jung (17 papers)
Citations (1)