Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Speeding Up Entmax (2111.06832v3)

Published 12 Nov 2021 in cs.CL and cs.LG

Abstract: Softmax is the de facto standard in modern neural networks for language processing when it comes to normalizing logits. However, by producing a dense probability distribution each token in the vocabulary has a nonzero chance of being selected at each generation step, leading to a variety of reported problems in text generation. $\alpha$-entmax of Peters et al. (2019, arXiv:1905.05702) solves this problem, but is considerably slower than softmax. In this paper, we propose an alternative to $\alpha$-entmax, which keeps its virtuous characteristics, but is as fast as optimized softmax and achieves on par or better performance in machine translation task.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Maxat Tezekbayev (7 papers)
  2. Vassilina Nikoulina (28 papers)
  3. Matthias Gallé (31 papers)
  4. Zhenisbek Assylbekov (16 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.