2000 character limit reached
PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English (2110.12243v1)
Published 23 Oct 2021 in cs.CL
Abstract: We present the Prepositions Annotated with Supersense Tags in Reddit International English ("PASTRIE") corpus, a new dataset containing manually annotated preposition supersenses of English data from presumed speakers of four L1s: English, French, German, and Spanish. The annotations are comprehensive, covering all preposition types and tokens in the sample. Along with the corpus, we provide analysis of distributional patterns across the included L1s and a discussion of the influence of L1s on L2 preposition choice.