Still out there: Modeling and Identifying Russian Troll Accounts on Twitter (1901.11162v1)

Published 31 Jan 2019 in cs.SI and cs.CY

Abstract: There is evidence that Russia's Internet Research Agency attempted to interfere with the 2016 U.S. election by running fake accounts on Twitter - often referred to as "Russian trolls". In this work, we: 1) develop machine learning models that predict whether a Twitter account is a Russian troll within a set of 170K control accounts; and, 2) demonstrate that it is possible to use this model to find active accounts on Twitter still likely acting on behalf of the Russian state. Using both behavioral and linguistic features, we show that it is possible to distinguish between a troll and a non-troll with a precision of 78.5% and an AUC of 98.9%, under cross-validation. Applying the model to out-of-sample accounts still active today, we find that up to 2.6% of top journalists' mentions are occupied by Russian trolls. These findings imply that the Russian trolls are very likely still active today. Additional analysis shows that they are not merely software-controlled bots, and manage their online identities in various complex ways. Finally, we argue that if it is possible to discover these accounts using externally - accessible data, then the platforms - with access to a variety of private internal signals - should succeed at similar or better rates.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (9)

Jane Im (5 papers)
Eshwar Chandrasekharan (16 papers)
Jackson Sargent (4 papers)
Paige Lighthammer (1 paper)
Taylor Denby (1 paper)
Ankit Bhargava (1 paper)
Libby Hemphill (33 papers)
David Jurgens (69 papers)
Eric Gilbert (20 papers)

Citations (93)

View on Semantic Scholar

Still out there: Modeling and Identifying Russian Troll Accounts on Twitter (1901.11162v1)

Related Papers