Determinantal Beam Search (2106.07400v4)

Published 14 Jun 2021 in cs.CL

Abstract: Beam search is a go-to strategy for decoding neural sequence models. The algorithm can naturally be viewed as a subset optimization problem, albeit one where the corresponding set function does not reflect interactions between candidates. Empirically, this leads to sets often exhibiting high overlap, e.g., strings may differ by only a single word. Yet in use-cases that call for multiple solutions, a diverse or representative set is often desired. To address this issue, we propose a reformulation of beam search, which we call determinantal beam search. Determinantal beam search has a natural relationship to determinantal point processes (DPPs), models over sets that inherently encode intra-set interactions. By posing iterations in beam search as a series of subdeterminant maximization problems, we can turn the algorithm into a diverse subset selection process. In a case study, we use the string subsequence kernel to explicitly encourage n-gram coverage in text generated from a sequence model. We observe that our algorithm offers competitive performance against other diverse set generation strategies in the context of language generation, while providing a more general approach to optimizing for diversity.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (37)

Authors (3)

Clara Meister (39 papers)
Martina Forster (4 papers)
Ryan Cotterell (226 papers)

Citations (13)

View on Semantic Scholar

YouTube

Show All Videos

Determinantal Beam Search (2106.07400v4)

Related Papers

YouTube