Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

HetVis: A Visual Analysis Approach for Identifying Data Heterogeneity in Horizontal Federated Learning (2208.07491v1)

Published 16 Aug 2022 in cs.HC

Abstract: Horizontal federated learning (HFL) enables distributed clients to train a shared model and keep their data privacy. In training high-quality HFL models, the data heterogeneity among clients is one of the major concerns. However, due to the security issue and the complexity of deep learning models, it is challenging to investigate data heterogeneity across different clients. To address this issue, based on a requirement analysis we developed a visual analytics tool, HetVis, for participating clients to explore data heterogeneity. We identify data heterogeneity through comparing prediction behaviors of the global federated model and the stand-alone model trained with local data. Then, a context-aware clustering of the inconsistent records is done, to provide a summary of data heterogeneity. Combining with the proposed comparison techniques, we develop a novel set of visualizations to identify heterogeneity issues in HFL. We designed three case studies to introduce how HetVis can assist client analysts in understanding different types of heterogeneity issues. Expert reviews and a comparative study demonstrate the effectiveness of HetVis.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Xumeng Wang (9 papers)
  2. Wei Chen (1290 papers)
  3. Jiazhi Xia (18 papers)
  4. Zhen Wen (13 papers)
  5. Rongchen Zhu (2 papers)
  6. Tobias Schreck (13 papers)
Citations (17)

Summary

We haven't generated a summary for this paper yet.