Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Still out there: Modeling and Identifying Russian Troll Accounts on Twitter (1901.11162v1)

Published 31 Jan 2019 in cs.SI and cs.CY

Abstract: There is evidence that Russia's Internet Research Agency attempted to interfere with the 2016 U.S. election by running fake accounts on Twitter - often referred to as "Russian trolls". In this work, we: 1) develop machine learning models that predict whether a Twitter account is a Russian troll within a set of 170K control accounts; and, 2) demonstrate that it is possible to use this model to find active accounts on Twitter still likely acting on behalf of the Russian state. Using both behavioral and linguistic features, we show that it is possible to distinguish between a troll and a non-troll with a precision of 78.5% and an AUC of 98.9%, under cross-validation. Applying the model to out-of-sample accounts still active today, we find that up to 2.6% of top journalists' mentions are occupied by Russian trolls. These findings imply that the Russian trolls are very likely still active today. Additional analysis shows that they are not merely software-controlled bots, and manage their online identities in various complex ways. Finally, we argue that if it is possible to discover these accounts using externally - accessible data, then the platforms - with access to a variety of private internal signals - should succeed at similar or better rates.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Jane Im (5 papers)
  2. Eshwar Chandrasekharan (16 papers)
  3. Jackson Sargent (4 papers)
  4. Paige Lighthammer (1 paper)
  5. Taylor Denby (1 paper)
  6. Ankit Bhargava (1 paper)
  7. Libby Hemphill (33 papers)
  8. David Jurgens (69 papers)
  9. Eric Gilbert (20 papers)
Citations (93)