We Don't Speak the Same Language: Interpreting Polarization through Machine Translation (2010.02339v2)

Published 5 Oct 2020 in cs.CL and cs.CY

Abstract: Polarization among US political parties, media and elites is a widely studied topic. Prominent lines of prior research across multiple disciplines have observed and analyzed growing polarization in social media. In this paper, we present a new methodology that offers a fresh perspective on interpreting polarization through the lens of machine translation. With a novel proposition that two sub-communities are speaking in two different \emph{languages}, we demonstrate that modern machine translation methods can provide a simple yet powerful and interpretable framework to understand the differences between two (or more) large-scale social media discussion data sets at the granularity of words. Via a substantial corpus of 86.6 million comments by 6.5 million users on over 200,000 news videos hosted by YouTube channels of four prominent US news networks, we demonstrate that simple word-level and phrase-level translation pairs can reveal deep insights into the current political divide -- what is \emph{black lives matter} to one can be \emph{all lives matter} to the other.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Ashiqur R. KhudaBukhsh (29 papers)
Rupak Sarkar (11 papers)
Mark S. Kamlet (2 papers)
Tom M. Mitchell (20 papers)

Citations (43)

View on Semantic Scholar

YouTube

Show All Videos

We Don't Speak the Same Language: Interpreting Polarization through Machine Translation (2010.02339v2)

Related Papers

YouTube