Knowledge Graph Representation for Political Information Sources (2404.03437v1)
Abstract: With the rise of computational social science, many scholars utilize data analysis and natural language processing tools to analyze social media, news articles, and other accessible data sources for examining political and social discourse. Particularly, the study of the emergence of echo-chambers due to the dissemination of specific information has become a topic of interest in mixed methods research areas. In this paper, we analyze data collected from two news portals, Breitbart News (BN) and New York Times (NYT) to prove the hypothesis that the formation of echo-chambers can be partially explained on the level of an individual information consumption rather than a collective topology of individuals' social networks. Our research findings are presented through knowledge graphs, utilizing a dataset spanning 11.5 years gathered from BN and NYT media portals. We demonstrate that the application of knowledge representation techniques to the aforementioned news streams highlights, contrary to common assumptions, shows relative "internal" neutrality of both sources and polarizing attitude towards a small fraction of entities. Additionally, we argue that such characteristics in information sources lead to fundamental disparities in audience worldviews, potentially acting as a catalyst for the formation of echo-chambers.
- Al-Tawil M. Aljarah I. Faris H. Wongthongtham P. Chan K. Y. Abu-Salih, B. and A. Beheshti. 2021. Relational learning analysis of social politics using knowledge graph embedding. Data Mining and Knowledge Discovery, 35(1):1497–1536.
- Alfred V. Aho and Jeffrey D. Ullman. 1972. The Theory of Parsing, Translation and Compiling, volume 1. Prentice-Hall, Englewood Cliffs, NJ.
- Hunt Allcott and Matthew Gentzkow. 2017. Social media and fake news in the 2016 election. Journal of economic perspectives, 31(2):211–36.
- American Psychological Association. 1983. Publications Manual. American Psychological Association, Washington, DC.
- Monica Anderson and Brooke Auxier. 2020. 55political posts and discussions. In Pew Research Center.
- Rie Kubota Ando and Tong Zhang. 2005. A framework for learning predictive structures from multiple tasks and unlabeled data. Journal of Machine Learning Research, 6:1817–1853.
- Galen Andrew and Jianfeng Gao. 2007. Scalable training of L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-regularized log-linear models. In Proceedings of the 24th International Conference on Machine Learning, pages 33–40.
- Sven Banisch and Eckehard Olbrich. 2019. Opinion polarization by learning from social feedback. The Journal of Mathematical Sociology, 43(2):76–103.
- Tweeteval: Unified benchmark and comparative evaluation for tweet classification. arXiv preprint arXiv:2010.12421.
- Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment, 2008(10):P10008.
- Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 5:135–146.
- E. Borel. 1921. La theorie du jeu et les equations integrales a noyau symetrique. Comptes rendus hebdomadaires des seances de l’Academie des sciences, (173):1304–1308.
- A generalized and adaptive method for community detection.
- Alternation. Journal of the Association for Computing Machinery, 28(1):114–133.
- Opinion-aware knowledge graph for political ideology detection. In IJCAI, pages 3647–3653.
- P. Chilton. 2004. Analysing political discourse: Theory and practice. Routledge.
- David D Clare and Timothy R Levine. 2019. Documenting the truth-default: The low frequency of spontaneous unprompted veracity assessments in deception detection. Human Communication Research, 45(3):286–308.
- Echo chamber or public sphere? predicting political orientation and measuring political homophily in twitter using big data. Journal of communication, 64(2):317–332.
- Manifesto of computational social science. The European Physical Journal Special Topics, 214(1):325–346.
- James W. Cooley and John W. Tukey. 1965. An algorithm for the machine calculation of complex Fourier series. Mathematics of Computation, 19(90):297–301.
- Computational ad hominem detection. pages 203–209.
- Grounding force-directed network layouts with latent space models. Journal of Computational Social Science, pages 1–33.
- Quantifying controversy in social media. pages 33–42.
- Cross-lingual classification of topics in political texts. In Proceedings of the Second Workshop on NLP and Computational Social Science, pages 42–46.
- Using bibliometric and social media analyses to explore the “echo chamber” hypothesis. Educational Policy, 28(2):281–305.
- New platform, old habits? candidates use of twitter during the 2010 british and dutch general election campaigns. New Media & Society, 18(5):765–783.
- O. Gross and R. Wagner. 1950. A continuous colonel blotto game. RAND Research Memorandum.
- The bayesian echo chamber: Modeling social influence via linguistic accommodation. In Artificial Intelligence and Statistics, pages 315–323.
- Dan Gusfield. 1997. Algorithms on Strings, Trees and Sequences. Cambridge University Press, Cambridge, UK.
- Lisa Harris and Paul Harrigan. 2015. Social media in politics: The ultimate voter engagement tool or simply an echo chamber? Journal of Political Marketing, 14(3):251–283.
- Daniel C Hellinger. 2018. Conspiracies and conspiracy theories in the age of trump. Springer.
- Forceatlas2, a continuous graph layout algorithm for handy network visualization designed for the gephi software. PloS one, 9:e98679.
- Social media polarization and echo chambers: A case study of covid-19.
- Logical fallacy detection. page 1.
- Kristen Johnson and Dan Goldwasser. 2018. Classification of moral foundations in microblog political discourse. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 720–730.
- A. Jungherr. 2014. The logic of political coverage on twitter: Temporal dynamics and content. Journal of Communication, 64(2):239–259.
- Jonathan P Kastellec and Eduardo L Leoni. 2007. Using graphs instead of tables in political science. Perspectives on politics, pages 755–771.
- Y. Kim. 2014. Convolutional neural networks for sentence classification. ArXiv:1408.5882.
- Philipp Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. In MT summit, volume 5, pages 79–86. Citeseer.
- D. Kovenock and B. Roberson. 2015. Generalizations of the general lotto and colonel blotto games. CESifo Working Paper 5291.
- Manifesto corpus. WZB Berlin Social Science Center.
- Onawa P Lacewell and Annika Werner. 2013. Coder training: Key to enhancing coding reliability and estimate validity.
- Supervised and traditional term weighting methods for automatic text categorization. IEEE transactions on pattern analysis and machine intelligence, 31(4):721–735.
- Distributive politics and electoral competition. Journal of Economic Theory, (103):106–130.
- The science of fake news. Science, 359(6380):1094–1096.
- Manifesto corpus 2017-1. WZB Berlin Social Science Center.
- Timothy R Levine. 2014. Truth-default theory (tdt) a theory of human deception and deception detection. Journal of Language and Social Psychology, 33(4):378–392.
- Steven Loria et al. 2018. textblob documentation. Release 0.15, 2(8):269.
- Openord: An open-source toolbox for large graph layout. Proc SPIE, 7868:786806.
- The manifesto corpus: A new resource for research on political parties and quantitative text analysis. Research & Politics, 3(2):2053168016643346.
- Coder reliability and misclassification in the human coding of party manifestos. Political Analysis, 20(1):78–91.
- R.B. Myerson. 1993. Incentives to cultivate minorities under alternative electoral systems. American Political Science Review, 87:856–869.
- Transformer based deep intelligent contextual embedding for twitter sentiment analysis. Future Generation Computer Systems, 113.
- The dynamics of public attention: Agenda setting theory meets big data. Journal of Communication, 64(2):193–214.
- Sri Nurdiati and Cornelis Hoede. 2008. 25 years development of knowledge graph theory: the results and the challenge. Memorandum, 1876(2):1–10.
- A. Osorio. 2013. The lottery blotto game. Economics Letters, 120(2):164–166.
- I. Parker. 2014. Discourse Dynamics (Psychology Revivals): Critical Analysis for Social and Individual Psychology. Routledge.
- Tiago P Peixoto. 2020. Latent poisson models for networks with heterogeneous density. Physical Review E, 102(1):012309.
- Semi-supervised sequence tagging with bidirectional language models. ArXiv, abs/1705.00108.
- Fewer people, more flames: How pre-existing beliefs and volume of negative comments impact online news readers’ verbal aggression. Telematics and Informatics, 56:101471.
- Mohammad Sadegh Rasooli and Joel R. Tetreault. 2015. Yara parser: A fast and accurate dependency parser. Computing Research Repository, arXiv:1503.06733. Version 2.
- Text classification for monolingual political manifestos with words out of vocabulary. In COMPLEXIS, pages 149–154.
- B. Roberson. 2006a. The colonel blotto game. Economic Theory, 29(1):1–24.
- B. Roberson. 2006b. Pork-barrel politics, discriminatory policies and fiscal federalism. Social Science Research Center Berlin (WZB).
- Richard Rogers. 2013. Digital methods. MIT press.
- The closed loop between opinion formation and personalised recommendations.
- Martin Rosvall and Carl T Bergstrom. 2008. Maps of random walks on complex networks reveal community structure. Proceedings of the national academy of sciences, 105(4):1118–1123.
- Big data, digital media, and computational social science: Possibilities and perils. The ANNALS of the American Academy of Political and Social Science, 659(1):6–13.
- Network structure and patterns of information diversity on twitter. MIS Quarterly, 42.
- Fake news detection on social media: A data mining perspective. ACM SIGKDD explorations newsletter, 19(1):22–36.
- Supervised open information extraction. In NAACL-HLT.
- Hierarchical structured model for fine-to-coarse manifesto text analysis. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 1964–1974.
- Joint sentence-document model for manifesto text analysis. In Proceedings of the Australasian Language Technology Association Workshop 2017, pages 25–33.
- Claimskg: a knowledge graph of fact-checked claims. In International Semantic Web Conference, pages 309–324. Springer.
- Get out the vote: Determining support or opposition from congressional floor-debate transcripts. arXiv preprint cs/0607062.
- Garbage in, garbage out: can statisticians quantify the effects of poor data? Chance, 7(2):20–27.
- Automatic thematic classification of election manifestos. Information Processing & Management, 50(4):554–567.
- Deep reasoning with knowledge graph for social relationship understanding. In Proceedings of the 27th International Joint Conference on Artificial Intelligence, pages 1021–1028.
- Network analysis and political science. Annual Review of Political Science, 14:245–264.
- A. Washburn. 2013. Blotto politics. Operations Research, 61(3):532–543.
- Ivan P Yamshchikov and Sharwin Rezagholi. 2019. Elephants, donkeys, and colonel blotto. In COMPLEXIS, pages 113–119.
- Classifying topics and detecting topic shifts in political manifestos.