Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Investigating Bias with a Synthetic Data Generator: Empirical Evidence and Philosophical Interpretation (2209.05889v1)

Published 13 Sep 2022 in stat.ML, cs.AI, cs.CY, and cs.LG

Abstract: Machine learning applications are becoming increasingly pervasive in our society. Since these decision-making systems rely on data-driven learning, risk is that they will systematically spread the bias embedded in data. In this paper, we propose to analyze biases by introducing a framework for generating synthetic data with specific types of bias and their combinations. We delve into the nature of these biases discussing their relationship to moral and justice frameworks. Finally, we exploit our proposed synthetic data generator to perform experiments on different scenarios, with various bias combinations. We thus analyze the impact of biases on performance and fairness metrics both in non-mitigated and mitigated machine learning models.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Alessandro Castelnovo (7 papers)
  2. Riccardo Crupi (15 papers)
  3. Nicole Inverardi (2 papers)
  4. Daniele Regoli (13 papers)
  5. Andrea Cosentini (2 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.