Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Robust Training for Conversational Question Answering Models with Reinforced Reformulation Generation (2310.13505v3)

Published 20 Oct 2023 in cs.CL, cs.AI, and cs.IR

Abstract: Models for conversational question answering (ConvQA) over knowledge graphs (KGs) are usually trained and tested on benchmarks of gold QA pairs. This implies that training is limited to surface forms seen in the respective datasets, and evaluation is on a small set of held-out questions. Through our proposed framework REIGN, we take several steps to remedy this restricted learning setup. First, we systematically generate reformulations of training questions to increase robustness of models to surface form variations. This is a particularly challenging problem, given the incomplete nature of such questions. Second, we guide ConvQA models towards higher performance by feeding it only those reformulations that help improve their answering quality, using deep reinforcement learning. Third, we demonstrate the viability of training major model components on one benchmark and applying them zero-shot to another. Finally, for a rigorous evaluation of robustness for trained models, we use and release large numbers of diverse reformulations generated by prompting GPT for benchmark test sets (resulting in 20x increase in sizes). Our findings show that ConvQA models with robust training via reformulations, significantly outperform those with standard training from gold QA pairs only.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (87)
  1. Never-ending learning for open-domain question answering over knowledge bases. In WWW.
  2. ComQA: A Community-sourced Dataset for Complex Factoid Question Answering with Paraphrase Clusters. In NAACL-HLT ’19.
  3. Open-Domain Question Answering Goes Conversational via Question Rewriting. In arXiv.
  4. DBpedia: A nucleus for a Web of open data. The Semantic Web (2007).
  5. Fluent Response Generation for Conversational Question Answering. In ACL.
  6. Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation. In EMNLP.
  7. Hannah Bast and Elmar Haussmann. 2015. More accurate question answering on Freebase. In CIKM.
  8. Jonathan Berant and Percy Liang. 2014. Semantic parsing via paraphrasing. In ACL.
  9. Ask the right questions: Active question reformulation with reinforcement learning. In ICLR.
  10. Reinforced question rewriting for conversational question answering. arXiv (2022).
  11. QuAC: Question answering in context. In EMNLP.
  12. Look before you Hop: Conversational Question Answering over Knowledge Graphs Using Judicious Context Expansion. In CIKM.
  13. Beyond NED: Fast and Effective Search Space Reduction for Complex Question Answering over Knowledge Bases. In WSDM.
  14. Conversational Question Answering on Heterogeneous Sources. In SIGIR.
  15. Explainable Conversational Question Answering over Heterogeneous Sources via Iterative Graph Neural Networks. In SIGIR.
  16. Conversational Information Seeking: Theory and Application. In SIGIR.
  17. CAsT 2019: The conversational assistance track overview. In TREC.
  18. Multi-step retriever-reader interaction for scalable open-domain question answering. In ICLR.
  19. BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT.
  20. Learning to Paraphrase for Question Answering. In EMNLP.
  21. Paraphrase-driven learning for open question answering. In ACL.
  22. Open question answering over curated and extracted knowledge bases. In KDD.
  23. Paolo Ferragina and Ugo Scaiella. 2010. TAGME: On-the-fly annotation of short text fragments (by Wikipedia entities). In CIKM. 1625–1628.
  24. Wee Chung Gan and Hwee Tou Ng. 2019. Improving the robustness of question answering systems to question paraphrasing. In ACL.
  25. On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method. (2023).
  26. Dialog-to-action: Conversational question answering over a large-scale knowledge base. In NeurIPS.
  27. Somil Gupta and Neeraj Sharma. 2021. Role of Attentive History Selection in Conversational Information Seeking. In arXiv.
  28. Natural Language Based Reformulation Resource and Wide Exploitation for Question Answering.. In TREC.
  29. Lynette Hirschman and Robert Gaizauskas. 2001. Natural language question answering: The view from here. Natural Language Engineering 7, 4 (2001), 275–300.
  30. Integrating Question Rewrites in Conversational Question Answering: A Reinforcement Learning Approach. In ACL Student Research Workshop.
  31. Can Question Rewriting Help Conversational Question Answering?. In 3rd Workshop on Insights from Negative Results in NLP.
  32. Parag Jain and Mirella Lapata. 2023. Conversational Semantic Parsing using Dynamic Context Graphs. arXiv preprint arXiv:2305.06164 (2023).
  33. Conversational Question Answering over Knowledge Graphs with Transformer and Graph Attention Networks. In EACL.
  34. Contrastive Representation Learning for Conversational Question Answering over Knowledge Graphs. In CIKM.
  35. Conversational Question Answering over Passages by Leveraging Word Proximity Networks. In SIGIR.
  36. Reinforcement learning from reformulations in conversational question answering over knowledge graphs. In SIGIR.
  37. Knowledge-augmented Self-training of A Question Rewriter for Conversational Knowledge Base Question Answering. In EMNLP.
  38. Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering. In ACL.
  39. Yunshi Lan and Jing Jiang. 2021. Modeling transitions of focal entities for conversational knowledge base question answering. In ACL.
  40. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In ACL.
  41. A Cooperative Neural Information Retrieval Pipeline with Knowledge Enhanced Automatic Query Reformulation. In WSDM.
  42. MMCoQA: Conversational Question Answering over Text, Tables, and Images. In ACL.
  43. Multi-stage conversational passage retrieval: An approach to fusing term importance estimation and neural query rewriting. TOIS (2021).
  44. Trond Linjordet and Krisztian Balog. 2022. Would You Ask it that Way? Measuring and Improving Question Naturalness for Knowledge Graph Question Answering. In SIGIR.
  45. Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space. In EMNLP.
  46. Rainier: Reinforced knowledge introspector for commonsense question answering. arXiv (2022).
  47. Generative question refinement with deep reinforcement learning in retrieval-based QA system. In CIKM.
  48. Ying-Hsang Liu and Nicholas J Belkin. 2008. Query reformulation, search performance, and term suggestion devices in question-answering tasks. In IIiX.
  49. Structured Context and High-Coverage Grammar for Conversational Question Answering over Knowledge Graphs. In EMNLP.
  50. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015).
  51. ConvGQR: Generative Query Reformulation for Conversational Search.
  52. DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering. arXiv preprint arXiv:2211.05655 (2022).
  53. Rodrigo Nogueira and Kyunghyun Cho. 2017. Task-oriented query reformulation with reinforcement learning. In EMNLP.
  54. Exploiting Simulated User Feedback for Conversational Search: Ranking, Rewriting, and Beyond. In SIGIR.
  55. Semantic Parsing for Conversational Question Answering over Knowledge Graphs. In EACL.
  56. Feedback-based self-learning in large-scale conversational AI agents. In IAAI (AAAI Workshop).
  57. Training Question Answering Models From Synthetic Data. In EMNLP.
  58. Reinforced History Backtracking for Conversational Question Answering. In AAAI.
  59. Open-Retrieval Conversational Question Answering. In SIGIR.
  60. BERT with history answer embedding for conversational question answering. In SIGIR.
  61. Attentive history selection for conversational question answering. In CIKM.
  62. Filip Radlinski and Nick Craswell. 2017. A theoretical framework for conversational search. In CHIIR.
  63. Question rewriting? Assessing its importance for conversational question answering. In ECIR 2022.
  64. CoQA: A conversational question answering challenge. TACL 7 (2019).
  65. Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In EMNLP-IJCNLP.
  66. Rishiraj Saha Roy and Avishek Anand. 2022. Question Answering for the Curated Web: Tasks and Methods in QA over Knowledge Bases and Text Collections. Synthesis Lectures on Information Concepts, Retrieval, and Services (2022).
  67. Mrinmaya Sachan and Eric Xing. 2018. Self-training for jointly learning to ask and answer questions. In NAACL.
  68. Complex sequential question answering: Towards learning to converse over linked question answer pairs with a knowledge graph. In AAAI.
  69. Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base. In EMNLP.
  70. Product Answer Generation from Heterogeneous Sources: A New Benchmark and Best Practices. In ECNLP.
  71. YAGO: A core of semantic knowledge. In WWW.
  72. History Semantic Graph Enhanced Conversational KBQA with Temporal Information Modeling. In ACL.
  73. Noriko Tomuro. 2003. Interrogative reformulation patterns and acquisition of question paraphrases. In Proceedings of the second international workshop on Paraphrasing.
  74. Noriko Tomuro and Steven L. Lytinen. 2001. Selecting features for paraphrasing question sentences. In NLPRS.
  75. 7th Open challenge on question answering over linked data (QALD-7). In Semantic Web Evaluation Challenge.
  76. A Wrong Answer or a Wrong Question? An Intricate Relationship between Question Reformulation and Answer Selection in Conversational Question Answering. In SCAI.
  77. Question rewriting for conversational question answering. In WSDM.
  78. Ellen M. Voorhees. 1999. The TREC-8 question answering track report. In TREC.
  79. Query resolution for conversational search with limited supervision. In SIGIR.
  80. Denny Vrandečić and Markus Krötzsch. 2014. Wikidata: A free collaborative knowledge base. CACM 57, 10 (2014).
  81. Ronald J Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning 8, 3-4 (1992).
  82. Automatically mining question reformulation patterns from search log data. In ACL.
  83. Data augmentation for bert fine-tuning in open-domain question answering. arXiv preprint arXiv:1904.06652 (2019).
  84. Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base. In ACL.
  85. Few-Shot Generative Conversational Query Rewriting. In SIGIR.
  86. Conversational information seeking. arXiv preprint arXiv:2201.08808 (2022).
  87. Analyzing and Simulating User Utterance Reformulation in Conversational Recommender Systems. In SIGIR.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Magdalena Kaiser (6 papers)
  2. Rishiraj Saha Roy (23 papers)
  3. Gerhard Weikum (75 papers)
Citations (1)