Clue-Instruct: Text-Based Clue Generation for Educational Crossword Puzzles (2404.06186v1)
Abstract: Crossword puzzles are popular linguistic games often used as tools to engage students in learning. Educational crosswords are characterized by less cryptic and more factual clues that distinguish them from traditional crossword puzzles. Despite there exist several publicly available clue-answer pair databases for traditional crosswords, educational clue-answer pairs datasets are missing. In this article, we propose a methodology to build educational clue generation datasets that can be used to instruct LLMs. By gathering from Wikipedia pages informative content associated with relevant keywords, we use LLMs to automatically generate pedagogical clues related to the given input keyword and its context. With such an approach, we created clue-instruct, a dataset containing 44,075 unique examples with text-keyword pairs associated with three distinct crossword clues. We used clue-instruct to instruct different LLMs to generate educational clues from a given input content and keyword. Both human and automatic evaluations confirmed the quality of the generated clues, thus validating the effectiveness of our approach.
- Solving italian crosswords using the web. In AI* IA 2005: Advances in Artificial Intelligence: 9th Congress of the Italian Association for Artificial Intelligence, Milan, Italy, September 21-32, 2005. Proceedings 9, pages 393–405. Springer.
- Webcrow: A web-based crosswords solver. In Intelligent Technologies for Interactive Entertainment: First International Conference, INTETAIN 2005, Madonna di Campiglio, Italy, November 30–December 2, 2005. Proceedings 1, pages 295–298. Springer.
- The webcrow french crossword solver. arXiv preprint arXiv:2311.15626.
- Automatic keyword extraction and crossword generation tool for indian languages: Seekh. In 2019 IEEE Tenth International Conference on Technology for Education (T4E), pages 272–273. IEEE.
- Sacry: Syntax-based automatic crossword puzzle resolution system. In Proceedings of 53nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Beijing, China, July. Association for Computational Linguistics.
- Yolanda Dita Bella and Endang Mastuti Rahayu. 2023. The improving of the student’s vocabulary achievement through crossword game in the new normal era. Edunesia: Jurnal Ilmiah Pendidikan, 4(2):830–842.
- Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901.
- Sunita M Dol. 2017. Gpbl: An effective way to improve critical thinking and problem solving skills in engineering education. J Engin Educ Trans, 30(3):103–13.
- Dzulfikri Dzulfikri. 2016. Application-based crossword puzzles: Players’ perception and vocabulary retention. Studies in English Language and Education, 3(2):122–133.
- Webcrow: A web-based system for crossword solving. In AAAI, pages 1412–1417.
- A web-based agent challenges human experts on crosswords. AI Magazine, 29:77–90.
- Automatic definition extraction and crossword generation from spanish news text. CLEI Electronic Journal, 20(2).
- Matthew L Ginsberg. 2011. Dr. fill: Crosswords and an implemented solver for singly weighted csps. Journal of Artificial Intelligence Research, 42:851–886.
- The curious case of neural text degeneration. arXiv preprint arXiv:1904.09751.
- Lora: Low-rank adaptation of large language models. arXiv preprint arXiv:2106.09685.
- The effect of crossword puzzle activity used in distance education on nursing students’ problem-solving and clinical decision-making skills: A comparative study. Nurse Education in Practice, 69:103618.
- Solving crosswords with proverb. In AAAI/IAAI, pages 914–915.
- Shane T Mueller and Elizabeth S Veinott. 2018. Testing the effectiveness of crossword games on immediate and delayed memory for scientific vocabulary and concepts. In CogSci.
- RS Nickerson. 1977. Crossword puzzles and lexical memory. In Attention and performance VI, pages 699–718. Routledge.
- Wiwat Orawiwatnakul. 2013. Crossword puzzles as a learning tool for vocabulary development. Electronic Journal of Research in Education Psychology, 11(30):413–428.
- Automatic generation of fill-in clues and answers from raw texts for crosswords. In 2013 8th International Conference on Information Technology in Asia (CITA), pages 1–5. IEEE.
- Leonardo Rigutini. 2010. Automatic Text Processing: Machine Learning Techniques. LAP Lambert Academic Publishing.
- A fully automatic crossword generator. In 2008 Seventh International Conference on Machine Learning and Applications, pages 362–367. IEEE.
- Automatic generation of crossword puzzles. International Journal on Artificial Intelligence Tools, 21(03):1250014.
- Corina Sandiuc and Alina Balagiu. 2020. The use of crossword puzzles as a strategy to teach maritime english vocabulary. Scientific Bulletin" Mircea cel Batran" Naval Academy, 23(1):236A–242.
- MosaicML NLP Team. 2023. Introducing mpt-30b: Raising the bar for open-source foundation models. Accessed: 2023-06-22.
- Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.
- Automated crossword solving. arXiv preprint arXiv:2205.09665.
- Self-instruct: Aligning language model with self generated instructions. arXiv preprint arXiv:2212.10560.
- Crossword puzzles for chemistry education: learning goals beyond vocabulary. Chemistry education research and practice, 17(3):532–554.
- The use of crossword puzzles as an educational tool. Journal of Advances in Medical Education & Professionalism, 9(2):102.
- Building bridges of knowledge: Innovating education with automated crossword generation. In 2023 International Conference on Machine Learning and Applications (ICMLA), pages 1228–1236.
- ArabIcros: AI-powered Arabic crossword puzzle generation for educational applications. In Proceedings of ArabicNLP 2023, pages 288–301, Singapore (Hybrid). Association for Computational Linguistics.
- Italian crossword generator: Enhancing education through interactive word puzzles. arXiv preprint arXiv:2311.15723.
- Lima: Less is more for alignment.
- Gaming in education: Using games as a support tool to teach history. Journal of Education and Practice, 8(15):55–64.
- Andrea Zugarini and Marco Ernandes. 2021. A multi-strategy approach to crossword clue answer retrieval and ranking. In Proceedings of the Eighth Italian Conference on Computational Linguistics, CLiC-it Milan, Italy.
- Die rätselrevolution: Automated german crossword solving. In Proceedings of the 9th Italian Conference on Computational Linguistics, CLiC-it, Venice, Italy.
- Andrea Zugarini (22 papers)
- Kamyar Zeinalipour (12 papers)
- Surya Sai Kadali (1 paper)
- Marco Maggini (36 papers)
- Marco Gori (82 papers)
- Leonardo Rigutini (16 papers)