Open Assistant Toolkit -- version 2 (2403.00586v1)
Abstract: We present the second version of the Open Assistant Toolkit (OAT-v2), an open-source task-oriented conversational system for composing generative neural models. OAT-v2 is a scalable and flexible assistant platform supporting multiple domains and modalities of user interaction. It splits processing a user utterance into modular system components, including submodules such as action code generation, multimodal content retrieval, and knowledge-augmented response generation. Developed over multiple years of the Alexa TaskBot challenge, OAT-v2 is a proven system that enables scalable and robust experimentation in experimental and real-world deployment. OAT-v2 provides open models and software for research and commercial applications to enable the future of multimodal virtual assistants across diverse applications and types of rich interaction.
- Alexa, let’s work together: Introducing the second alexa prize taskbot challenge. Alexa Prize TaskBot Challenge, 2.
- Genie: A generator of natural language semantic parsers for virtual assistant commands. In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 394–410.
- Twiz: The wizard of multimodal conversational-stimulus. In Alexa Prize TaskBot Challenge 2 Proceedings.
- Vilt: Video instructions linking for complex tasks. In Proceedings of the 2nd International Workshop on Interactive Multimedia Retrieval, pages 41–47.
- Grillbot in practice: Lessons and tradeoffs deploying large language models for adaptable conversational task assistants.
- Grillbot-v2: Generative models for multi-modal task-oriented assistance. Alexa Prize TaskBot Challenge, 2.
- Carlos Gemmell and Jeffrey Dalton. 2023. Generate, transform, answer: Question specific tool synthesis for tabular data. arXiv preprint arXiv:2303.10138.
- Grillbot: A flexible conversational agent for solving complex real-world tasks. Alexa Prize TaskBot Challenge, 1.
- Alexa, let’s work together: Introducing the first alexa prize taskbot challenge on conversational task assistance. Alexa Prize TaskBot Challenge, 1.
- OpenAI. 2022. Chatgpt: Optimizing language models for dialogue.
- Learning transferable visual models from natural language supervision. In International conference on machine learning, pages 8748–8763. PMLR.
- Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10684–10695.
- Stanford alpaca: An instruction-following llama model. https://github.com/tatsu-lab/stanford_alpaca.
- Llama: Open and efficient foundation language models. arXiv preprint arXiv:2302.13971.
- Pydial: A multi-domain statistical dialogue system toolkit. In Proceedings of ACL 2017, System Demonstrations, pages 73–78.
- Deeppavlov dream: platform for building generative ai assistants. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 599–607.
- Sophie Fischer (5 papers)
- Federico Rossetto (4 papers)
- Carlos Gemmell (9 papers)
- Andrew Ramsay (6 papers)
- Iain Mackie (14 papers)
- Philip Zubel (1 paper)
- Niklas Tecklenburg (2 papers)
- Jeffrey Dalton (20 papers)