Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Automated Distractor Generation for Math Multiple-choice Questions with Overgenerate-and-rank (2405.05144v2)

Published 19 Apr 2024 in cs.CY and cs.LG

Abstract: Multiple-choice questions (MCQs) are commonly used across all levels of math education since they can be deployed and graded at a large scale. A critical component of MCQs is the distractors, i.e., incorrect answers crafted to reflect student errors or misconceptions. Automatically generating them in math MCQs, e.g., with LLMs, has been challenging. In this work, we propose a novel method to enhance the quality of generated distractors through overgenerate-and-rank, training a ranking model to predict how likely distractors are to be selected by real students. Experimental results on a real-world dataset and human evaluation with math teachers show that our ranking model increases alignment with human-authored distractors, although human-authored ones are still preferred over generated ones.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Alexander Scarlatos (16 papers)
  2. Wanyong Feng (8 papers)
  3. Digory Smith (7 papers)
  4. Simon Woodhead (16 papers)
  5. Andrew Lan (48 papers)
Citations (1)