Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unsupervised Bias Detection in College Student Newspapers (2309.06557v1)

Published 11 Sep 2023 in cs.CL, cs.AI, and cs.LG

Abstract: This paper presents a pipeline with minimal human influence for scraping and detecting bias on college newspaper archives. This paper introduces a framework for scraping complex archive sites that automated tools fail to grab data from, and subsequently generates a dataset of 14 student papers with 23,154 entries. This data can also then be queried by keyword to calculate bias by comparing the sentiment of a LLM summary to the original article. The advantages of this approach are that it is less comparative than reconstruction bias and requires less labelled data than generating keyword sentiment. Results are calculated on politically charged words as well as control words to show how conclusions can be drawn. The complete method facilitates the extraction of nuanced insights with minimal assumptions and categorizations, paving the way for a more objective understanding of bias within student newspaper sources.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Adam M. Lehavi (4 papers)
  2. William McCormack (1 paper)
  3. Noah Kornfeld (1 paper)
  4. Solomon Glazer (1 paper)

Summary

We haven't generated a summary for this paper yet.