Improving negation detection with negation-focused pre-training (2205.04012v1)
Abstract: Negation is a common linguistic feature that is crucial in many language understanding tasks, yet it remains a hard problem due to diversity in its expression in different types of text. Recent work has shown that state-of-the-art NLP models underperform on samples containing negation in various tasks, and that negation detection models do not transfer well across domains. We propose a new negation-focused pre-training strategy, involving targeted data augmentation and negation masking, to better incorporate negation information into LLMs. Extensive experiments on common benchmarks show that our proposed approach improves negation detection performance and generalizability over the strong baseline NegBERT (Khandewal and Sawant, 2020).
- Thinh Hung Truong (9 papers)
- Timothy Baldwin (125 papers)
- Trevor Cohn (105 papers)
- Karin Verspoor (34 papers)