OpenPI2.0: An Improved Dataset for Entity Tracking in Texts (2305.14603v2)

Published 24 May 2023 in cs.CL

Abstract: Much text describes a changing world (e.g., procedures, stories, newswires), and understanding them requires tracking how entities change. An earlier dataset, OpenPI, provided crowdsourced annotations of entity state changes in text. However, a major limitation was that those annotations were free-form and did not identify salient changes, hampering model evaluation. To overcome these limitations, we present an improved dataset, OpenPI2.0, where entities and attributes are fully canonicalized and additional entity salience annotations are added. On our fairer evaluation setting, we find that current state-of-the-art LLMs are far from competent. We also show that using state changes of salient entities as a chain-of-thought prompt, downstream performance is improved on tasks such as question answering and classical planning, outperforming the setting involving all related entities indiscriminately. We offer OpenPI2.0 for the continued development of models that can understand the dynamics of entities in text.

References (51)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - allenai/openpi-dataset: OpenPI dataset for tracking entities in open domain procedural text (24 stars)

OpenPI2.0: An Improved Dataset for Entity Tracking in Texts (2305.14603v2)

Summary

Related Papers

GitHub