Papers
Topics
Authors
Recent
Search
2000 character limit reached

A Literature Survey of Recent Advances in Chatbots

Published 17 Jan 2022 in cs.CL | (2201.06657v1)

Abstract: Chatbots are intelligent conversational computer systems designed to mimic human conversation to enable automated online guidance and support. The increased benefits of chatbots led to their wide adoption by many industries in order to provide virtual assistance to customers. Chatbots utilise methods and algorithms from two Artificial Intelligence domains: Natural Language Processing and Machine Learning. However, there are many challenges and limitations in their application. In this survey we review recent advances on chatbots, where Artificial Intelligence and Natural Language processing are used. We highlight the main challenges and limitations of current work and make recommendations for future research investigation.

Citations (224)

Summary

  • The paper identifies the evolution from rule-based systems to advanced AI-driven models using IR and generative techniques.
  • It reviews methods like Sequence-to-Sequence, LSTM, and Transformer variants, noting improvements in context handling and model adaptability.
  • It highlights the need for tailored datasets and robust evaluation metrics to overcome challenges in dialogue continuity and domain adaptation.

A Comprehensive Survey of Advances in Chatbot Design and Implementation

The paper "A Literature Survey of Recent Advances in Chatbots" by Caldarini, Jaf, and McGarry offers an extensive review of the evolving landscape of chatbots, focusing on their implementation techniques, challenges, and applications. This essay provides an expert overview of the survey, synthesizing the insights drawn from the surveyed literature.

Chatbots, as conversational agents, have become integral to various industries due to their capability to automate dialogue and provide assistance. The paper categorizes chatbots into rule-based systems and AI-driven counterparts, with a particular focus on the latter due to recent advancements in NLP and Machine Learning (ML).

Implementation Approaches

The survey delineates the evolution of chatbot models from basic rule-based systems to more advanced AI models. AI chatbots are further classified into Information Retrieval (IR) and Generative-based models. While IR models depend on retrieving responses from predefined datasets, Generative models construct responses by employing neural network architectures, predominantly leveraging Encoder-Decoder frameworks with Long Short-Term Memory (LSTM) or more recent Transformer models. The authors highlight the widespread adoption of Sequence-to-Sequence models, which, despite their popularity, face limitations in context handling and maintain conversational coherence over extended dialogues.

The paper also examines the potential of Transformers and their variants like Transformer-XL and Reformer. These models exhibit improved context handling and efficiency, yet their computational demands and domain adaptation challenges limit their industry-wide applicability.

Datasets and Their Role

The paper identifies commonly used datasets like OpenSubtitles and Cornell Movie-dialogue Corpus, acknowledging their role in training open-domain models. However, the importance of closed-domain datasets tailored to specific industries is underscored, emphasizing the challenge posed by the unavailability of such domain-specific data for public use.

Evaluation Challenges

A significant observation in the survey is the absence of standardized evaluation metrics for chatbots. Existing metrics, such as BLEU and METEOR, adapted from machine translation, fail to capture dialogue-specific nuances like conversation continuity and engagement. The reliance on human evaluations poses scalability issues, indicating a pressing need for robust automated evaluation metrics.

Applications Across Domains

Chatbots find applications across diverse fields such as customer service, healthcare, education, and entertainment. The survey notes a preference for IR-based models in task-oriented domains due to their reliability in providing specific information. Meanwhile, social chatbots or conversational agents, demanding a higher degree of linguistic complexity, often leverage more sophisticated generative models.

Implications and Future Directions

The paper's recommendations underscore the necessity for developing flexible, domain-independent models that can efficiently handle both open and closed-domain conversations. Additionally, the exploration of more scalable and accurate evaluation models is crucial for advancing chatbot technology.

In conclusion, while significant progress has been made in chatbot research, the paper identifies key areas needing attention: the development of smaller, adaptable models and improvements in evaluation methodologies. Addressing these gaps could facilitate broader adoption across various sectors, expanding the functional scope of chatbots in real-world applications. As NLP and ML technologies continue to evolve, they hold the promise of making chatbots increasingly interactive, contextually aware, and seamlessly integrated into everyday digital interactions.

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.