Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An Investigation Into Race Bias in Random Forest Models Based on Breast DCE-MRI Derived Radiomics Features (2309.17197v1)

Published 29 Sep 2023 in cs.LG, cs.AI, and cs.CV

Abstract: Recent research has shown that AI models can exhibit bias in performance when trained using data that are imbalanced by protected attribute(s). Most work to date has focused on deep learning models, but classical AI techniques that make use of hand-crafted features may also be susceptible to such bias. In this paper we investigate the potential for race bias in random forest (RF) models trained using radiomics features. Our application is prediction of tumour molecular subtype from dynamic contrast enhanced magnetic resonance imaging (DCE-MRI) of breast cancer patients. Our results show that radiomics features derived from DCE-MRI data do contain race-identifiable information, and that RF models can be trained to predict White and Black race from these data with 60-70% accuracy, depending on the subset of features used. Furthermore, RF models trained to predict tumour molecular subtype using race-imbalanced data seem to produce biased behaviour, exhibiting better performance on test data from the race on which they were trained.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Mohamed Huti (1 paper)
  2. Tiarna Lee (6 papers)
  3. Elinor Sawyer (1 paper)
  4. Andrew P. King (56 papers)
Citations (1)