Towards Diverse and Effective Question-Answer Pair Generation from Children Storybooks (2306.06605v1)
Abstract: Recent advances in QA pair generation (QAG) have raised interest in applying this technique to the educational field. However, the diversity of QA types remains a challenge despite its contributions to comprehensive learning and assessment of children. In this paper, we propose a QAG framework that enhances QA type diversity by producing different interrogative sentences and implicit/explicit answers. Our framework comprises a QFS-based answer generator, an iterative QA generator, and a relevancy-aware ranker. The two generators aim to expand the number of candidates while covering various types. The ranker trained on the in-context negative samples clarifies the top-N outputs based on the ranking score. Extensive evaluations and detailed analyses demonstrate that our approach outperforms previous state-of-the-art results by significant margins, achieving improved diversity and quality. Our task-oriented processes are consistent with real-world demand, which highlights our system's high applicability.
- Kathleen Cotton. 1988. Classroom questioning. School improvement research series, 5:1–22.
- James T Dillon. 2006. Effect of questions in education and other enterprises. In Rethinking schooling, pages 145–174. Routledge.
- Closed-book question generation via contrastive learning. arXiv preprint arXiv:2210.06781.
- Learning to ask: Neural question generation for reading comprehension. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1342–1352.
- A feasibility study of answer-agnostic question generation for education. In Findings of the Association for Computational Linguistics: ACL 2022, pages 1919–1926, Dublin, Ireland. Association for Computational Linguistics.
- Kathleen Ellis. 1993. Teacher questioning behavior and student learning: What research says to teachers.
- The role of questioning technique in developing thinking skills: The ongoing effect on writing skill. Procedia-Social and Behavioral Sciences, 70:1024–1031.
- Frank J Guszak. 1967. Teacher questioning and reading. The reading teacher, 21(3):227–234.
- Violeta Janusheva and Milena Pejchinovska. 2009. Questions posing importance and role in the teaching process.
- Automatic question generation and the smartstart application. In Proceedings of the Eighth ACM Conference on Learning@ Scale, pages 365–366.
- Let me know what to ask: Interrogative-word-aware question generation. In Proceedings of the 2nd Workshop on Machine Reading for Question Answering, pages 163–171.
- Young-Suk Grace Kim. 2017. Why the simple view of reading is not simplistic: Unpacking component skills of reading using a direct and indirect effect model of reading (dier). Scientific Studies of Reading, 21(4):310–333.
- The narrativeqa reading comprehension challenge. Transactions of the Association for Computational Linguistics, 6:317–328.
- Klaus Krippendorff. 2011. Computing krippendorff’s alpha-reliability.
- Deep questions without deep understanding. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 889–898.
- Automatic question generation for educational applications–the state of art. Advanced computational methods for knowledge engineering, pages 325–338.
- Generating diverse and consistent QA pairs from contexts with information-maximizing hierarchical conditional VAEs. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 208–224, Online. Association for Computational Linguistics.
- Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7871–7880.
- Consecutive question generation via dynamic multitask learning. arXiv preprint arXiv:2211.08850.
- Generating natural language questions to support learning on-line. In Proceedings of the 14th European Workshop on Natural Language Generation, pages 105–114.
- Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
- Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res., 21(140):1–67.
- Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108.
- Self-attention architectures for answer-agnostic neural question generation. In Proceedings of the 57th annual meeting of the Association for Computational Linguistics, pages 6027–6032.
- End-to-end synthetic data generation for domain adaptation of question answering systems. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 5445–5460, Online. Association for Computational Linguistics.
- Questioning techniques and teachers’ role in the classroom. Shanlax International Journal of Education, 8(4):45–49.
- Exploring neural models for query-focused summarization. In Findings of the Association for Computational Linguistics: NAACL 2022, pages 1455–1468, Seattle, United States. Association for Computational Linguistics.
- Exploring neural models for query-focused summarization. arXiv preprint arXiv:2112.07637.
- Towards process-oriented, modular, and versatile question generation that meets educational needs. arXiv preprint arXiv:2205.00355.
- Fantastic questions and where to find them: FairytaleQA – an authentic dataset for narrative comprehension. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 447–460, Dublin, Ireland. Association for Computational Linguistics.
- It is AI’s turn to ask humans a question: Question-answer pair generation for children’s story books. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 731–744, Dublin, Ireland. Association for Computational Linguistics.
- A review on question generation from natural language text. ACM Transactions on Information Systems (TOIS), 40(1):1–43.
- Shiyue Zhang and Mohit Bansal. 2019. Addressing semantic drift in question generation for semi-supervised question answering. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 2495–2509.
- Bertscore: Evaluating text generation with bert. arXiv preprint arXiv:1904.09675.
- Educational question generation of children storybooks via question type distribution learning and event-centric summarization. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5073–5085, Dublin, Ireland. Association for Computational Linguistics.
- Multi-task learning with language modeling for question generation. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3394–3399, Hong Kong, China. Association for Computational Linguistics.
- Sugyeong Eo (11 papers)
- Hyeonseok Moon (20 papers)
- Jinsung Kim (10 papers)
- Yuna Hur (4 papers)
- Jeongwook Kim (2 papers)
- Songeun Lee (1 paper)
- Changwoo Chun (1 paper)
- Sungsoo Park (7 papers)
- Heuiseok Lim (49 papers)