Disentangling comprehension from memorization on the Swahili proverbs task
Ascertain whether the observed accuracy of large language models on the BIG-bench swahili_english_proverbs task reflects genuine Swahili language understanding or instead memorization of specific proverbs from Internet sources used during pretraining.
References
However, it is not clear whether this performance indicates general understanding of Swahili or instead memorization of proverbs listed on the internet.
— Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
(2206.04615 - Srivastava et al., 2022) in Section “Performance on non-English languages,” subsection “Low-resource language tasks are particularly challenging”