Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale (2402.12629v2)
Abstract: In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimodal essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scale multimedia analysis. Utilizing state-of-the-art computer vision algorithms and speech-to-text methods, we transcribe, diarize, and analyze thousands of YouTube videos of a prime-time television debate show in India. These debates are a central part of Indian media but have been criticized for compromised journalistic integrity and excessive dramatization. Our toolkit provides concrete metrics to assess bias and incivility, capturing a comprehensive multimedia perspective that includes text, audio utterances, and video frames. Our findings reveal significant biases in topic selection and panelist representation, along with alarming levels of incivility. This work offers a scalable, automated approach for future research in multimedia analysis, with profound implications for the quality of public discourse and democratic debate. To catalyze further research in this area, we also release the code, dataset collected and supplemental pdf.
- 2023. Same Words, Different Meanings: Semantic Polarization in Broadcast Media Language Forecasts Polarity in Online Public Discourse. 17 (Jun. 2023), 161–172. https://doi.org/10.1609/icwsm.v17i1.22135
- Automatic Detection of Shouted Speech Segments in Indian News Debates.. In Interspeech. 4179–4183.
- Radiotalk: A large-scale corpus of talk radio transcripts. arXiv preprint arXiv:1907.07073 (2019).
- Prashanth Bhat and Kalyani Chadha. 2023. Expanding public debate? Examining the impact of India’s top English language political talk shows. Media Asia 50, 2 (2023), 244–263.
- Jay G Blumler. 1970. The political effects of television. The political effects of television (1970), 68–104.
- Hervé Bredin and Antoine Laurent. 2021. End-to-end speaker segmentation for overlap-aware resegmentation. In Interspeech 2021.
- Pyannote. audio: neural building blocks for speaker diarization. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 7124–7128.
- Broadcast Audience Research Council. 2022. Broadcast Audience Research Council Data 2022. https://www.barc.co.in/
- Christophe Jafrelot and Vihang Jumle. 2020. One-Man Show: A study of 1,779 Republic TV debates reveals how the channel champions Narendra Modi. available at: https://caravanmagazine.in/media/republic-debates-study-shows-channel-promotoes-modi-ndtv.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. CoRR abs/1810.04805 (2018). arXiv:1810.04805 http://arxiv.org/abs/1810.04805
- Michael Doherty. 2007. Politicians as a Species of ‘Public Figure’ and the Right to Privacy. Humanitas Journal of European Studies () 1, 1 (2007), 35–56.
- Onaiza Drabu. 2018. Who is the Muslim? Discursive representations of the Muslims and Islam in Indian prime-time news. Religions 9, 9 (2018), 283.
- GDELT. 2017. GDELT Summary: Television Explorer — api.gdeltproject.org. https://api.gdeltproject.org/api/v2/summary/summary?d=iatv. [Accessed 11-10-2023].
- SK Hussain. 2020. The Dirty Game Pro-Hindutva TV Channels And Their Anchors Play — old.indiatomorrow.net. https://old.indiatomorrow.net/eng/the-dirty-game-pro-hindutva-tv-channels-and-their-anchors-play. [Accessed 13-10-2023].
- Automated Coding of Televised Leader Displays: Detecting Nonverbal Political Behavior With Computer Vision and Deep Learning. International Journal of Communication 13 (2019).
- Aditya Khandelwal and Suraj Sawant. 2020. NegBERT: A Transfer Learning Approach for Negation Detection and Scope Resolution. In Proceedings of the Twelfth Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, 5739–5748. https://aclanthology.org/2020.lrec-1.704
- Toward automated factchecking: Developing an annotation schema and benchmark for consistent automated claim detection. Digital threats: research and practice 2, 2 (2021), 1–16.
- Raksha Kumar. 2023. How Indian TV news became a theatre of aggression fanning the flames of populism — reutersinstitute.politics.ox.ac.uk. https://reutersinstitute.politics.ox.ac.uk/news/how-indian-tv-news-became-theatre-aggression-fanning-flames-populism.
- A new generation of perspective api: Efficient multilingual character-level transformers. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3197–3207.
- Efficient domain adaptation for speech foundation models. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1–5.
- A novel video key-frame-extraction algorithm based on perceived motion energy model. IEEE transactions on circuits and systems for video technology 13, 10 (2003), 1006–1013.
- Naveen Mishra. 2018. Broadcast Media, Mediated Noise, and Discursive Violence-High Decibel TV Debates and the Interrupted Public Sphere. KOME: An International Journal of Pure Communication Inquiry 6, 1 (2018), 1–13.
- Mining Insights from Large-Scale Corpora Using Fine-Tuned Language Models. In European Conference on Artificial Intelligence. https://api.semanticscholar.org/CorpusID:212412401
- Zizi Papacharissi and Maria de Fatima Oliveira. 2008. News frames terrorism: A comparative analysis of frames employed in terrorism coverage in US and UK newspapers. The international journal of press/politics 13, 1 (2008), 52–74.
- A review of speaker diarization: Recent advances with deep learning. Computer Speech & Language 72 (2022), 101317.
- Detection of shouted speech in noise: Human and machine. The Journal of the Acoustical Society of America 133, 4 (2013), 2377–2389.
- Testing the validity of automatic speech recognition for political text analysis. Political Analysis 27, 3 (2019), 339–359.
- Robust speech recognition via large-scale weak supervision. In International Conference on Machine Learning. PMLR, 28492–28518.
- Visual analytics of political networks from face-tracking of news video. IEEE Transactions on Multimedia 18, 11 (2016), 2184–2195.
- Media bias monitor: Quantifying biases of social media news outlets at large-scale. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 12.
- rkcosmos. 2020. GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. — github.com. https://github.com/JaidedAI/EasyOCR. [Accessed 13-10-2023].
- Social media news communities: gatekeeping, coverage, and statement bias. In Proceedings of the 22nd ACM international conference on Information & Knowledge Management. 1679–1684.
- Video summarization using deep learning techniques: a detailed analysis and investigation. Artif. Intell. Rev. 56, 11 (2023), 12347–12385. https://doi.org/10.1007/s10462-023-10444-0
- seatgeek. [n. d.]. thefuzz. https://github.com/seatgeek/thefuzz.
- Analysis of media bias in policy discourse in india. In ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies (COMPASS). 57–77.
- Sefik Ilkin Serengil and Alper Ozpinar. 2020. LightFace: A Hybrid Deep Face Recognition Framework. In 2020 Innovations in Intelligent Systems and Applications Conference (ASYU). IEEE, 23–27. https://doi.org/10.1109/ASYU50717.2020.9259802
- Sefik Ilkin Serengil and Alper Ozpinar. 2021. HyperExtended LightFace: A Facial Attribute Analysis Framework. In 2021 International Conference on Engineering and Emerging Technologies (ICEET). IEEE, 1–4. https://doi.org/10.1109/ICEET53442.2021.9659697
- Mohammed Sinan Siyech. 2019. The Pulwama Attack. Counter Terrorist Trends and Analyses 11, 4 (2019), 6–10.
- Scroll Staff. 2016. Watch: Why Arnab Goswami’s shouting worked — scroll.in. https://scroll.in/video/823774/watch-why-arnab-goswami-s-shouting-worked. [Accessed 13-10-2023].
- Paul Subhajit and Uttam Kr Pegu. 2021. Media Polarization and Assertion of Majoritarianism in Indian News Media. The Journal of Communication and Media Studies 6, 2 (2021), 1.
- Axiomatic Attribution for Deep Networks. In Proceedings of the 34th International Conference on Machine Learning - Volume 70 (Sydney, NSW, Australia) (ICML’17). JMLR.org, 3319–3328.
- India Today. [n. d.]. INDIA bloc to boycott shows of 14 TV journalists, media panel condemns move — indiatoday.in. https://www.indiatoday.in/india/story/opposition-bloc-india-bloc-to-boycott-shows-of-14-tv-journalists-bjp-says-bullying-media-2435788-2023-09-14. [Accessed 12-10-2023].
- Llama 2: Open Foundation and Fine-Tuned Chat Models. arXiv:2307.09288 [cs.CL]
- James Turk. [n. d.]. GitHub - jamesturk/jellyfish: A python library for doing approximate and phonetic matching of strings. https://github.com/jamesturk/jellyfish. [Accessed 11-10-2023].
- Reporters without Borders. 2023. India — rsf.org. https://rsf.org/en/country/india. [Accessed 29-09-2023].