Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Are LLMs Good Cryptic Crossword Solvers? (2403.12094v1)

Published 15 Mar 2024 in cs.AI, cs.CL, and cs.LG

Abstract: Cryptic crosswords are puzzles that rely not only on general knowledge but also on the solver's ability to manipulate language on different levels and deal with various types of wordplay. Previous research suggests that solving such puzzles is a challenge even for modern NLP models. However, the abilities of LLMs have not yet been tested on this task. In this paper, we establish the benchmark results for three popular LLMs -- LLaMA2, Mistral, and ChatGPT -- showing that their performance on this task is still far from that of humans.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Abdelrahman "Boda" Sadallah (2 papers)
  2. Daria Kotova (3 papers)
  3. Ekaterina Kochmar (33 papers)
Citations (2)
X Twitter Logo Streamline Icon: https://streamlinehq.com