A Survey on Out-of-Distribution Detection in NLP (2305.03236v2)
Abstract: Out-of-distribution (OOD) detection is essential for the reliable and safe deployment of machine learning systems in the real world. Great progress has been made over the past years. This paper presents the first review of recent advances in OOD detection with a particular focus on natural language processing approaches. First, we provide a formal definition of OOD detection and discuss several related fields. We then categorize recent algorithms into three classes according to the data they used: (1) OOD data available, (2) OOD data unavailable + in-distribution (ID) label available, and (3) OOD data unavailable + ID label unavailable. Third, we introduce datasets, applications, and metrics. Finally, we summarize existing work and present potential future research topics.
- Survey on anomaly detection using data mining techniques. Procedia Computer Science, 2015.
- Concrete problems in ai safety. arXiv preprint arXiv:1606.06565, 2016.
- Types of out-of-distribution texts and how to detect them. In EMNLP, 2021.
- Feed two birds with one scone: Exploiting wild data for both out-of-distribution generalization and detection. In ICML, 2023.
- Towards open world recognition. In CVPR, 2015.
- Towards open set deep networks. In CVPR, 2016.
- Experience report: Log mining using natural language processing and application to anomaly detection. In ISSRE, 2017.
- Domain adaptation with structural correspondence learning. In EMNLP, 2006.
- On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258, 2021.
- Lof: identifying density-based local outliers. In SIGMOD, 2000.
- Language models are few-shot learners. In NeurIPS, 2020.
- Efficient intent detection with dual sentence encoders. In Proceedings of the 2nd Workshop on Natural Language Processing for Conversational AI, 2020.
- Deep learning for anomaly detection: A survey. arXiv preprint arXiv:1901.03407, 2019.
- Gold: Improving out-of-scope detection in dialogues using data augmentation. In EMNLP, 2021.
- Enhanced lstm for natural language inference. In ACL, 2017.
- Holistic sentence embeddings for better out-of-distribution detection. In EMNLP, 2022.
- Enhancing out-of-distribution detection in natural language understanding via implicit layer ensemble. In EMNLP, 2022.
- Outflip: Generating examples for unknown intent detection with natural language attack. In ACL-IJCNLP, 2021.
- Chataug: Leveraging chatgpt for text data augmentation. arXiv preprint arXiv:2302.13007, 2023a.
- Exploring large language models for multi-modal out-of-distribution detection. In EMNLP, 2023b.
- On the effects of transformer size on in-and out-of-domain calibration. In EMNLP, 2021.
- The relationship between precision-recall and roc curves. In ICML, 2006.
- Calibration of pre-trained transformers. In EMNLP, 2020.
- Unknown-aware object detection: Learning what you don’t know from videos in the wild. In CVPR, 2022a.
- Vos: Learning what you don’t know by virtual outlier synthesis. In ICLR, 2022b.
- Dream the impossible: Outlier imagination with diffusion models. In NeurIPS, 2023.
- Barle: Background-aware representation learning for background shift out-of-distribution detection. In EMNLP, 2022.
- Is out-of-distribution detection learnable? In NeurIPS, 2022.
- Breaking the closed world assumption in text classification. In NAACL, 2016.
- Dropout as a bayesian approximation: Representing model uncertainty in deep learning. In ICML, 2016.
- Likelihood ratios and generative classifiers for unsupervised out-of-domain detection in task oriented dialog. In AAAI, 2020.
- Selective classification for deep neural networks. In NeurIPS, 2017.
- Generative adversarial networks. Communications of the ACM, 63(11), 2020.
- Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572, 2014.
- Recent advances in convolutional neural networks. Pattern recognition, 77, 2018.
- Statistical analysis of nearest neighbor methods for anomaly detection. In NeurIPS, 2019.
- Supervised contrastive learning for pre-trained language model fine-tuning. In ICLR, 2021.
- Out-of-distribution detection in unsupervised continual learning. In CVPR, 2022.
- A baseline for detecting misclassified and out-of-distribution examples in neural networks. In ICLR, 2017.
- Deep anomaly detection with outlier exposure. In ICLR, 2019a.
- Using self-supervised learning can improve model robustness and uncertainty. In NeurIPS, 2019b.
- Pretrained transformers improve out-of-distribution robustness. In ACL, 2020.
- Long short-term memory. Neural computation, 9(8), 1997.
- Selective question answering under domain shift. In ACL, 2020.
- Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. Bert: Pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT, 2019.
- Supervised contrastive learning. In NeurIPS, 2020.
- Continual learning based on ood detection and task masking. In CVPR, 2022.
- Pseudo outlier exposure for out-of-distribution detection using pretrained transformers. In ACL, 2023.
- Joint learning of domain classification and out-of-domain detection with dynamic class weighting for satisficing false acceptance rates. arXiv preprint arXiv:1807.00072, 2018.
- Wilds: A benchmark of in-the-wild distribution shifts. In ICML, 2021.
- Internet-augmented dialogue generation. In ACL, 2022.
- Calibrated language model fine-tuning for in-and out-of-distribution data. In EMNLP, 2020.
- Calibration of encoder decoder models for neural machine translation. arXiv preprint arXiv:1903.00802, 2019.
- Simple and scalable predictive uncertainty estimation using deep ensembles. In NeurIPS, 2017.
- Estimating soft labels for out-of-domain intent detection. In EMNLP, 2022.
- Out-of-domain intent detection considering multi-turn dialogue contexts. arXiv preprint arXiv:2305.03237, 2023.
- An evaluation dataset for intent classification and out-of-scope prediction. In EMNLP-IJCNLP, 2019.
- Training confidence-calibrated classifiers for detecting out-of-distribution samples. In ICLR, 2018a.
- A simple unified framework for detecting out-of-distribution samples and adversarial attacks. In NeurIPS, 2018b.
- Learning to classify texts using positive and unlabeled data. In IJCAI, 2003.
- kfolden: k-fold ensemble for out-of-distribution detection-fold ensemble for out-of-distribution detection. In EMNLP, 2021.
- Enhancing the reliability of out-of-distribution image detection in neural networks. In ICLR, 2018.
- Deep unknown intent detection with margin loss. In ACL, 2019.
- Lifelong and continual learning dialogue systems: learning during conversation. In AAAI, 2021.
- Energy-based out-of-distribution detection. In NeurIPS, 2020.
- Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692, 2019.
- Detecting compositionally out-of-distribution examples in semantic parsing. In EMNLP, 2021.
- Foundations of statistical natural language processing. MIT press, 1999.
- Oodgan: Generative adversarial network for out-of-domain data generation. In NAACL, 2021.
- Visual classification via description from large language models. In ICLR, 2023.
- How does fine-tuning impact out-of-distribution detection for vision-language models? International Journal of Computer Vision, 2023.
- Delving into out-of-distribution detection with vision-language representations. In NeurIPS, 2022.
- Provable guarantees for understanding out-of-distribution detection. In AAAI, 2022.
- Star: A schema-guided dialog dataset for transfer learning. arXiv preprint arXiv:2010.11853, 2020.
- Uninl: Aligning representation learning with scoring function for ood detection via unified neighborhood learning. In EMNLP, 2022.
- A framework for anomaly detection using language modeling, and its applications to finance. arXiv preprint arXiv:1908.09156, 2019.
- Energy-based unknown intent detection with data manipulation. In ACL-IJCNLP, 2021.
- On prefix-tuning for lightweight out-of-distribution detection. In ACL, 2023.
- Can you trust your model’s uncertainty? evaluating predictive uncertainty under dataset shift. In NeurIPS, 2019.
- Deep learning for anomaly detection: A review. ACM Computing Surveys (CSUR), 54(2), 2021.
- An empirical analysis of formality in online communication. TACL, 2016.
- Language models as knowledge bases? In EMNLP-IJCNLP, 2019.
- Revisiting mahalanobis distance for transformer-based out-of-domain detection. In AAAI, 2021.
- Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 2020.
- Neural unsupervised domain adaptation in nlp—a survey. In COLING, 2020.
- Pnpood: Out-of-distribution detection for text classification via plug andplay data augmentation. arXiv preprint arXiv:2111.00506, 2021.
- Likelihood ratios for out-of-distribution detection. In NeurIPS, 2019.
- Out-of-distribution detection and selective generation for conditional language models. In ICLR, 2023.
- High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
- Transfer learning in natural language processing. In NAACL: Tutorials, June 2019.
- Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems. Pattern Recognition Letters, 2017.
- Out-of-domain detection based on generative adversarial network. In EMNLP, 2018.
- Probability models for open set recognition. TPAMI, 2014.
- Cross-lingual transfer learning for multilingual task oriented dialog. In NAACL, 2019.
- Enhancing the generalization for intent classification and out-of-domain detection in slu. In ACL, 2021.
- Doc: Deep open classification of text documents. In EMNLP, 2017.
- Odist: Open world classification via distributionally shifted instances. In EMNLP, 2021.
- Prompting gpt-3 to be reliable. In ICLR, 2023.
- Out-of-distribution detection with deep nearest neighbors. In ICML, 2022.
- Out-of-domain detection for low-resource text classification tasks. In EMNLP-IJCNLP, 2019.
- Non-parametric outlier synthesis. In ICLR, 2023.
- Is fine-tuning needed? pre-trained language models are near perfect for out-of-domain detection. In ACL, 2023.
- Vladimir Vapnik. Principles of risk minimization for learning theory. In NeurIPS, 1991.
- Investigating selective prediction approaches across several tasks in IID, OOD, and adversarial settings. In ACL, 2022.
- Attention is all you need. In NeurIPS, 2017.
- Efficient out-of-domain detection for sequence to sequence models. In ACL, 2023.
- Manifold mixup: Better representations by interpolating hidden states. In ICML, 2019.
- A perspective view and survey of meta-learning. Artificial intelligence review, 18(2), 2002.
- Out-of-distribution detection using an ensemble of self supervised leave-out classifiers. In ECCV, 2018.
- Generalizing to unseen domains: A survey on domain generalization. TKDE, 2022.
- A survey of zero-shot learning: Settings, methods, and applications. TIST, 2019.
- Multi-level knowledge distillation for out-of-distribution detection in text. In ACL, 2023.
- Revisit overconfidence for OOD detection: Reassigned contrastive learning with adaptive class-dependent threshold. In NAACL, 2022.
- The art of abstention: Selective prediction and error regularization for natural language processing. In ACL, 2021.
- Open-world learning and application to product classification. In WWW, 2019.
- Short text clustering via convolutional neural networks. In Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, 2015.
- Unsupervised out-of-domain detection via pre-trained transformers. In ACL, 2021.
- A survey on log anomaly detection using deep learning. In ICRITO, 2020.
- Unknown intent detection using gaussian mixture model with an application to zero-shot intent classification. In ACL, 2020.
- Generalized out-of-distribution detection: A survey. arXiv preprint arXiv:2110.11334, 2021.
- Openood: Benchmarking generalized out-of-distribution detection. In NeurIPS, 2022.
- Seqgan: Sequence generative adversarial nets with policy gradient. In AAAI, 2017.
- Modeling discriminative representations for out-of-domain detection with supervised contrastive learning. In ACL, 2021a.
- Adversarial generative distance-based classifier for robust out-of-domain detection. In ICASSP, 2021b.
- Out-of-scope intent detection with self-supervision and discriminative training. In ACL, 2021.
- Learning from positive and unlabeled examples: A survey. In International Symposiums on Information Processing, 2008.
- Deep open intent classification with adaptive decision boundary. In AAAI, 2021.
- mixup: Beyond empirical risk minimization. In ICLR, 2018.
- Pegasus: Pre-training with extracted gap-sentences for abstractive summarization. In ICML, 2020.
- Out-of-domain detection for natural language understanding in dialog systems. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020.
- Learning placeholders for open-set recognition. In CVPR, 2021a.
- Domain generalization: A survey. TPAMI, 2022a.
- Contrastive out-of-distribution detection for pretrained transformers. In EMNLP, 2021b.
- KNN-contrastive learning for out-of-domain intent classification. In ACL, 2022b.
- Two birds one stone: Dynamic ensemble for ood intent classification. In ACL, 2023.
- Deep autoencoding gaussian mixture model for unsupervised anomaly detection. In ICLR, 2018.
- Hao Lang (10 papers)
- Yinhe Zheng (30 papers)
- Yixuan Li (183 papers)
- Jian Sun (415 papers)
- Fei Huang (409 papers)
- Yongbin Li (128 papers)