Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Making Large Language Models Interactive: A Pioneer Study on Supporting Complex Information-Seeking Tasks with Implicit Constraints (2205.00584v2)

Published 2 May 2022 in cs.IR

Abstract: Current interactive systems with natural language interfaces lack the ability to understand a complex information-seeking request which expresses several implicit constraints at once, and there is no prior information about user preferences e.g.,"find hiking trails around San Francisco which are accessible with toddlers and have beautiful scenery in summer", where output is a list of possible suggestions for users to start their exploration. In such scenarios, user requests can be issued in one shot in the form of a complex and long query, unlike conversational and exploratory search models, where require short utterances or queries are often presented to the system step by step. We have designed and deployed a platform to collect the data from approaching such complex interactive systems. Moreover, despite with the current advancement of generative LLMs these models suffer from hallucination in providing accurate factual knowledge. All LLMs are mostly trained in large part on web-scraped data from the past, which usually is not useful for immediate users' needs. In this article, we propose an IA that leverages LLMs (LLM) for complex request understanding and makes it interactive using Reinforcement learning that allows intricately refine user requests by making them complete, leading to better retrieval and reduce LLMs hallucination problems for current user needs. To demonstrate the performance of the proposed modeling paradigm, we have adopted various pre-retrieval metrics that capture the extent to which guided interactions with our system yield better retrieval results. Through extensive experimentation, we demonstrated that our method significantly outperforms several robust baselines.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (99)
  1. Towards a human-like open-domain chatbot. arXiv preprint arXiv:2001.09977, 2020.
  2. Find it if you can: a game for modeling different types of web search success using interaction data. In SIGIR, 2011.
  3. Convai3: Generating clarifying questions for open-domain dialogue systems (clariq). 2020.
  4. Building and evaluating open-domain dialogue corpora with clarifying questions. arXiv preprint arXiv:2109.05794, 2021.
  5. Geometric estimation of specificity within embedding spaces. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pages 2109–2112, 2019.
  6. Neural embedding-based specificity metrics for pre-retrieval query performance prediction. Information Processing & Management, 57(4):102248, 2020a.
  7. Neural embedding-based metrics for pre-retrieval query performance prediction. Advances in Information Retrieval, 12036:78, 2020b.
  8. Bert-qpp: Contextualized pre-trained transformers for query performance prediction. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, pages 2857–2861, 2021.
  9. Local clustering in contextual multi-armed bandits. In Proceedings of the Web Conference 2021, pages 2335–2346, 2021.
  10. Introduction to bandits in recommender systems. In Fourteenth ACM Conference on Recommender Systems, pages 748–750, 2020.
  11. Nicholas J Belkin. Anomalous states of knowledge as a basis for information retrieval. Canadian journal of information science, 5(1):133–143, 1980.
  12. Semantic parsing on freebase from question-answer pairs. In Proceedings of the 2013 conference on empirical methods in natural language processing, pages 1533–1544, 2013.
  13. The continuous cold start problem in e-commerce recommender systems. arXiv preprint arXiv:1508.01177, 2015.
  14. A survey on practical applications of multi-armed and contextual bandits. arXiv preprint arXiv:1904.10040, 2019.
  15. Language models are few-shot learners. arXiv preprint arXiv:2005.14165, 2020.
  16. Query performance prediction for ir. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, pages 1196–1197, 2012.
  17. Estimating the query difficulty for information retrieval. Synthesis Lectures on Information Concepts, Retrieval, and Services, 2(1):1–89, 2010.
  18. Universal sentence encoder. arXiv preprint arXiv:1803.11175, 2018.
  19. Konstantina Christakopoulou. Towards Recommendation Systems with Real-World Constraints. PhD thesis, University of Minnesota, 2018.
  20. Towards conversational recommender systems. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pages 815–824, 2016.
  21. Electra: Pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555, 2020.
  22. An effectiveness measure for ambiguous and underspecified queries. In Conference on the Theory of Information Retrieval, pages 188–199. Springer, 2009.
  23. Edgar F Codd. Seven steps to rendezvous with the casual user. IBM Corporation, 1974.
  24. Natural language interfaces to databases. 1990.
  25. David Cortes. Adapting multi-armed bandits policies to contextual bandits scenarios. arXiv preprint arXiv:1811.04383, 2018.
  26. Search engines: Information retrieval in practice, volume 520. Addison-Wesley Reading, 2010.
  27. Predicting query performance. In Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pages 299–306, 2002.
  28. Program synthesis using natural language. In Proceedings of the 38th International Conference on Software Engineering, pages 345–356. ACM, 2016.
  29. Bert: Pre-training of deep bidirectional transformers for language understanding. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), 2018.
  30. The second conversational intelligence challenge (convai2). In The NeurIPS’18 Competition, pages 187–208. Springer, Cham, 2020.
  31. Speak to your parser: Interactive text-to-SQL with natural language feedback. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 2065–2077, Online, July 2020. Association for Computational Linguistics. doi: 10.18653/v1/2020.acl-main.187. URL https://www.aclweb.org/anthology/2020.acl-main.187.
  32. Iris: A conversational agent for complex tasks. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 473. ACM, 2018.
  33. A multi-armed bandit model selection for cold-start user recommendation. In Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, pages 32–40, 2017.
  34. Offline a/b testing for recommender systems. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, pages 198–206, 2018.
  35. Performance prediction for non-factoid question answering. In Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, pages 55–58, 2019.
  36. Task tours: helping users tackle complex search tasks. In Proceedings of the 21st ACM international conference on Information and knowledge management, pages 1885–1889, 2012.
  37. Beyond dcg: user behavior as a predictor of a successful search. In WSDM, pages 221–230, 2010.
  38. Supporting complex search tasks. In Proceedings of the 23rd ACM international conference on conference on information and knowledge management, pages 829–838, 2014.
  39. A survey of pre-retrieval query performance predictors. In Proceedings of the 17th ACM conference on Information and knowledge management, pages 1419–1420, 2008.
  40. The combination and evaluation of query performance prediction methods. In European Conference on Information Retrieval, pages 301–312. Springer, 2009.
  41. Inferring query performance using pre-retrieval predictors. In International symposium on string processing and information retrieval, pages 43–54. Springer, 2004.
  42. Developing a natural language interface to complex data. ACM Transactions on Database Systems (TODS), 3(2):105–147, 1978.
  43. Andreas Holzinger. Interactive machine learning for health informatics: when do we need the human-in-the-loop? Brain Informatics, 3(2):119–131, 2016.
  44. The turn: Integration of information seeking and retrieval in context, volume 18. Springer Science & Business Media, 2006.
  45. Webwatcher: A tour guide for the world wide web. In IJCAI (1), pages 770–777. Citeseer, 1997.
  46. Reveal 2020: Bandit and reinforcement learning from user interactions. In Fourteenth ACM Conference on Recommender Systems, pages 628–629, 2020.
  47. Semantics-enabled query performance prediction for ad hoc table retrieval. Information Processing & Management, 58(1):102399, 2021.
  48. Modelling and detecting changes in user satisfaction. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, pages 1449–1458, 2014.
  49. Behavioral dynamics from the serp’s perspective: what are failed serps and how to fix them? In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pages 1561–1570, 2015.
  50. Beyond movie recommendations: Solving the continuous cold start problem in e-commercerecommendations. arXiv preprint arXiv:1607.07904, 2016a.
  51. Predicting user satisfaction with intelligent assistants. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 45–54, 2016b.
  52. Understanding user satisfaction with intelligent assistants. In Proceedings of the 2016 ACM on Conference on Human Information Interaction and Retrieval, pages 121–130, 2016c.
  53. Soliciting user preferences in conversational recommender systems via usage-related questions. In Fifteenth ACM Conference on Recommender Systems, pages 724–729, 2021.
  54. Modeling and analysis of cross-session search tasks. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval, pages 5–14, 2011.
  55. Klaus Krippendorff. Computing krippendorff’s alpha-reliability. 2011.
  56. Natural questions: a benchmark for question answering research. Transactions of the Association for Computational Linguistics, 7:453–466, 2019.
  57. Chatgpt: A meta-analysis after 2.5 months. arXiv preprint arXiv:2302.13795, 2023.
  58. Dialogue learning with human-in-the-loop. ICLR, 2016.
  59. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th international conference on World wide web, pages 661–670, 2010.
  60. Interactive task learning from GUI-grounded natural language instructions and demonstrations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, July 2020a.
  61. Dialogue generation: From imitation learning to inverse reinforcement learning. arXiv preprint arXiv:1812.03509, 2018.
  62. Guided dialog policy learning without adversarial learning in the loop. arXiv preprint arXiv:2004.03267, 2020b.
  63. Iterative policy learning in end-to-end trainable task-oriented neural dialog models. In 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pages 482–489. IEEE, 2017.
  64. Adversarial learning of task-oriented neural dialog models. In Proceedings of the SIGDIAL 2018 Conference, pages 350–359, 2018.
  65. Personalizing information retrieval for multi-session tasks: Examining the roles of task stage, task type, and topic knowledge on the interpretation of dwell time as an indicator of document usefulness. Journal of the Association for Information Science and Technology, 66(1):58–81, 2015.
  66. Roberta: A robustly optimized BERT pretraining approach. CoRR, abs/1907.11692, 2019. URL http://arxiv.org/abs/1907.11692.
  67. Gary Marchionini. Exploratory search: from finding to understanding. Communications of the ACM, 49(4):41–46, 2006.
  68. Ambigqa: Answering ambiguous open-domain questions. arXiv preprint arXiv:2004.10645, 2020.
  69. Evaluating the quality of query refinement suggestions in information retrieval. Technical report, MASSACHUSETTS UNIV AMHERST CENTER FOR INTELLIGENT INFORMATION RETRIEVAL, 2006.
  70. Struggling and success in web search. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pages 1551–1560, 2015.
  71. Scenttrails: Integrating browsing and searching on the web. ACM Transactions on Computer-Human Interaction (TOCHI), 10(3):177–197, 2003.
  72. OpenAI. Gpt-4 technical report. Technical report, arXiv:2303.08774 [cs.CL], 2023.
  73. Diverse user preference elicitation with multi-armed bandits. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pages 130–138, 2021.
  74. University of glasgow at trec 2004: Experiments in web, robust, and terabyte tracks with terrier. In TREC, 2004.
  75. Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 21(1):5485–5551, 2020.
  76. Haggai Roitman. Ictir tutorial: Modern query performance prediction: Theory and practice. In Proceedings of the 2020 ACM SIGIR on International Conference on Theory of Information Retrieval, pages 195–196, 2020.
  77. A study of query performance prediction for answer quality determination. In Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, pages 43–46, 2019.
  78. Recipes for building an open-domain chatbot. arXiv preprint arXiv:2004.13637, 2020.
  79. Estimating gaussian mixture models in the local neighbourhood of embedded word vectors for query performance prediction. Information Processing & Management, 56(3):1026–1045, 2019.
  80. Interactive intent modeling for exploratory search. ACM Transactions on Information Systems (TOIS), 36(4):1–46, 2018.
  81. Mark Sanderson. Ambiguous queries: test collections need more sense. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, pages 499–506, 2008.
  82. Query-performance prediction for effective query routing in domain-specific repositories. Journal of the Association for Information Science and Technology, 65(8):1597–1614, 2014.
  83. Preference elicitation as an optimization problem. In Proceedings of the 12th ACM Conference on Recommender Systems, pages 172–180, 2018.
  84. Studying trailfinding algorithms for enhanced web search. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, pages 443–450, 2010.
  85. NLQuAD: A non-factoid long question answering data set. In Conference of the European Chapter of the Association for Computational Linguistics (EACL), pages 1245–1255, 2021.
  86. Identification of ambiguous queries in web search. Information Processing & Management, 45(2):216–229, 2009.
  87. Understanding natural language commands for robotic navigation and mobile manipulation. In Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011.
  88. Randall H Trigg. Guided tours and tabletops: Tools for communicating in a hypertext environment. ACM Transactions on Information Systems (TOIS), 6(4):398–414, 1988.
  89. An aspectual interface for supporting complex search tasks. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pages 379–386, 2009.
  90. Exploratory search: Beyond the query-response paradigm. Synthesis lectures on information concepts, retrieval, and services, 1(1):1–98, 2009.
  91. Studying the use of popular destinations to enhance web search interaction. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 159–166, 2007.
  92. Evaluating exploratory search systems. Information Processing and Management, 44(2):433, 2008.
  93. The lunar sciences natural language information system: Final report. BBN Report 2378, 1972.
  94. Session-aware information embedding for e-commerce product recommendation. In Proceedings of the 2017 ACM on conference on information and knowledge management, pages 2379–2382, 2017.
  95. Model-based interactive semantic parsing: A unified framework and a text-to-SQL case study. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 5447–5458, Hong Kong, China, November 2019. Association for Computational Linguistics. doi: 10.18653/v1/D19-1547. URL https://www.aclweb.org/anthology/D19-1547.
  96. Pomdp-based statistical spoken dialog systems: A review. Proceedings of the IEEE, 101(5):1160–1179, 2013.
  97. Neural query performance prediction using weak supervision from multiple signals. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, pages 105–114, 2018.
  98. Dialogpt: Large-scale generative pre-training for conversational response generation. arXiv preprint arXiv:1911.00536, 2019.
  99. Effective pre-retrieval query performance prediction using similarity and variability evidence. In European conference on information retrieval, pages 52–64. Springer, 2008.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Ali Ahmadvand (14 papers)
  2. Negar Arabzadeh (28 papers)
  3. Julia Kiseleva (33 papers)
  4. Patricio Figueroa Sanz (2 papers)
  5. Xin Deng (26 papers)
  6. Sujay Jauhar (2 papers)
  7. Michael Gamon (9 papers)
  8. Eugene Agichtein (33 papers)
  9. Ned Friend (1 paper)
  10. Aniruddha (1 paper)
Citations (1)