Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

FairEM360: A Suite for Responsible Entity Matching (2404.07354v2)

Published 10 Apr 2024 in cs.DB, cs.CY, and cs.LG

Abstract: Entity matching is one the earliest tasks that occur in the big data pipeline and is alarmingly exposed to unintentional biases that affect the quality of data. Identifying and mitigating the biases that exist in the data or are introduced by the matcher at this stage can contribute to promoting fairness in downstream tasks. This demonstration showcases FairEM360, a framework for 1) auditing the output of entity matchers across a wide range of fairness measures and paradigms, 2) providing potential explanations for the underlying reasons for unfairness, and 3) providing resolutions for the unfairness issues through an exploratory process with human-in-the-loop feedback, utilizing an ensemble of matchers. We aspire for FairEM360 to contribute to the prioritization of fairness as a key consideration in the evaluation of EM pipelines.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (9)
  1. Hierarchical matching network for heterogeneous entity resolution. In IJCAI. 3665–3671.
  2. Pradap Venkatramanan Konda. 2018. Magellan: Toward building entity matching management systems. The University of Wisconsin-Madison.
  3. Deep entity matching with pre-trained language models. PVLDB 14, 1 (2020).
  4. Christoph Molnar. 2020. Interpretable machine learning. Lulu. com.
  5. Deep learning for entity matching: A design space exploration. In SIGMOD. 19–34.
  6. Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching. PVLDB 16, 11 (2023), 3279–3292.
  7. Representation bias in data: a survey on identification and resolution techniques. CSUR (2023).
  8. Fairness-aware Data Preparation for Entity Matching. ICDE (2024).
  9. Multi-context attention for entity matching. In TheWebConf. 2634–2640.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com