FairEM360: A Suite for Responsible Entity Matching (2404.07354v2)
Abstract: Entity matching is one the earliest tasks that occur in the big data pipeline and is alarmingly exposed to unintentional biases that affect the quality of data. Identifying and mitigating the biases that exist in the data or are introduced by the matcher at this stage can contribute to promoting fairness in downstream tasks. This demonstration showcases FairEM360, a framework for 1) auditing the output of entity matchers across a wide range of fairness measures and paradigms, 2) providing potential explanations for the underlying reasons for unfairness, and 3) providing resolutions for the unfairness issues through an exploratory process with human-in-the-loop feedback, utilizing an ensemble of matchers. We aspire for FairEM360 to contribute to the prioritization of fairness as a key consideration in the evaluation of EM pipelines.
- Hierarchical matching network for heterogeneous entity resolution. In IJCAI. 3665–3671.
- Pradap Venkatramanan Konda. 2018. Magellan: Toward building entity matching management systems. The University of Wisconsin-Madison.
- Deep entity matching with pre-trained language models. PVLDB 14, 1 (2020).
- Christoph Molnar. 2020. Interpretable machine learning. Lulu. com.
- Deep learning for entity matching: A design space exploration. In SIGMOD. 19–34.
- Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching. PVLDB 16, 11 (2023), 3279–3292.
- Representation bias in data: a survey on identification and resolution techniques. CSUR (2023).
- Fairness-aware Data Preparation for Entity Matching. ICDE (2024).
- Multi-context attention for entity matching. In TheWebConf. 2634–2640.