Probing What Different NLP Tasks Teach Machines about Function Word Comprehension (1904.11544v2)

Published 25 Apr 2019 in cs.CL

Abstract: We introduce a set of nine challenge tasks that test for the understanding of function words. These tasks are created by structurally mutating sentences from existing datasets to target the comprehension of specific types of function words (e.g., prepositions, wh-words). Using these probing tasks, we explore the effects of various pretraining objectives for sentence encoders (e.g., LLMing, CCG supertagging and natural language inference (NLI)) on the learned representations. Our results show that pretraining on LLMing performs the best on average across our probing tasks, supporting its widespread use for pretraining state-of-the-art NLP models, and CCG supertagging and NLI pretraining perform comparably. Overall, no pretraining objective dominates across the board, and our function word probing tasks highlight several intuitive differences between pretraining objectives, e.g., that NLI helps the comprehension of negation.

Citations (102)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Probing What Different NLP Tasks Teach Machines about Function Word Comprehension (1904.11544v2)

Summary

Related Papers