Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

We Don't Speak the Same Language: Interpreting Polarization through Machine Translation (2010.02339v2)

Published 5 Oct 2020 in cs.CL and cs.CY

Abstract: Polarization among US political parties, media and elites is a widely studied topic. Prominent lines of prior research across multiple disciplines have observed and analyzed growing polarization in social media. In this paper, we present a new methodology that offers a fresh perspective on interpreting polarization through the lens of machine translation. With a novel proposition that two sub-communities are speaking in two different \emph{languages}, we demonstrate that modern machine translation methods can provide a simple yet powerful and interpretable framework to understand the differences between two (or more) large-scale social media discussion data sets at the granularity of words. Via a substantial corpus of 86.6 million comments by 6.5 million users on over 200,000 news videos hosted by YouTube channels of four prominent US news networks, we demonstrate that simple word-level and phrase-level translation pairs can reveal deep insights into the current political divide -- what is \emph{black lives matter} to one can be \emph{all lives matter} to the other.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Ashiqur R. KhudaBukhsh (29 papers)
  2. Rupak Sarkar (11 papers)
  3. Mark S. Kamlet (2 papers)
  4. Tom M. Mitchell (20 papers)
Citations (43)
Youtube Logo Streamline Icon: https://streamlinehq.com