Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Metric Learning and Adaptive Boundary for Out-of-Domain Detection (2204.10849v1)

Published 22 Apr 2022 in cs.CL

Abstract: Conversational agents are usually designed for closed-world environments. Unfortunately, users can behave unexpectedly. Based on the open-world environment, we often encounter the situation that the training and test data are sampled from different distributions. Then, data from different distributions are called out-of-domain (OOD). A robust conversational agent needs to react to these OOD utterances adequately. Thus, the importance of robust OOD detection is emphasized. Unfortunately, collecting OOD data is a challenging task. We have designed an OOD detection algorithm independent of OOD data that outperforms a wide range of current state-of-the-art algorithms on publicly available datasets. Our algorithm is based on a simple but efficient approach of combining metric learning with adaptive decision boundary. Furthermore, compared to other algorithms, we have found that our proposed algorithm has significantly improved OOD performance in a scenario with a lower number of classes while preserving the accuracy for in-domain (IND) classes.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Petr Lorenc (8 papers)
  2. Tommaso Gargiani (2 papers)
  3. Jan Pichl (9 papers)
  4. Jakub Konrád (8 papers)
  5. Petr Marek (59 papers)
  6. Ondřej Kobza (4 papers)
  7. Jan Šedivý (11 papers)