2000 character limit reached
A ripple in time: a discontinuity in American history (2312.01185v6)
Published 2 Dec 2023 in cs.CL, cs.AI, cs.LG, and cs.SI
Abstract: In this technical note we suggest a novel approach to discover temporal (related and unrelated to language dilation) and personality (authorship attribution) aspects in historical datasets. We exemplify our approach on the State of the Union addresses given by the past 42 US presidents: this dataset is known for its relatively small amount of data, and high variability of the size and style of texts. Nevertheless, we manage to achieve about 95\% accuracy on the authorship attribution task, and pin down the date of writing to a single presidential term.
- Alexander Kolpakov (49 papers)
- Igor Rivin (29 papers)