Mechanism for Decision-aware Collaborative Federated Learning: A Pitfall of Shapley Values (2403.04753v1)
Abstract: This paper investigates mechanism design for decision-aware collaboration via federated learning (FL) platforms. Our framework consists of a digital platform and multiple decision-aware agents, each endowed with proprietary data sets. The platform offers an infrastructure that enables access to the data, creates incentives for collaborative learning aimed at operational decision-making, and conducts FL to avoid direct raw data sharing. The computation and communication efficiency of the FL process is inherently influenced by the agent participation equilibrium induced by the mechanism. Therefore, assessing the system's efficiency involves two critical factors: the surplus created by coalition formation and the communication costs incurred across the coalition during FL. To evaluate the system efficiency under the intricate interplay between mechanism design, agent participation, operational decision-making, and the performance of FL algorithms, we introduce a multi-action collaborative federated learning (MCFL) framework for decision-aware agents. Under this framework, we further analyze the equilibrium for the renowned Shapley value based mechanisms. Specifically, we examine the issue of false-name manipulation, a form of dishonest behavior where participating agents create duplicate fake identities to split their original data among these identities. By solving the agent participation equilibrium, we demonstrate that while Shapley value effectively maximizes coalition-generated surplus by encouraging full participation, it inadvertently promotes false-name manipulation. This further significantly increases the communication costs when the platform conducts FL. Thus, we highlight a significant pitfall of Shapley value based mechanisms, which implicitly incentivizes data splitting and identity duplication, ultimately impairing the overall efficiency in FL systems.
- Anily S, Haviv M (2010) Cooperation in service systems. Operations Research 58(3):660–673.
- Bergantinos G, Moreno-Ternero JD (2020) Sharing the revenues from broadcasting sport events. Management Science 66(6):2417–2431.
- Choudhury O (2023) Federated learning on aws with fedml: Health analytics without sharing sensitive data URL https://aws.amazon.com/blogs/machine-learning/federated-learning-on-aws-with-fedml-health-analytics-without-sharing-sensitive-data/, aWS Machine Learning Blog.
- Conitzer V, Yokoo M (2010) Using mechanism design to prevent false-name manipulations. AI magazine 31(4):65–78.
- Gopalakrishnan S, Sankaranarayanan S (2023) Cooperative security against interdependent risks. Production and Operations Management 32(11):3504–3520.
- Hsiao CR (2004) The power indices for multi-choice multi-valued games. Taiwanese Journal of Mathematics 8(2):259–270.
- Kemahlıoğlu-Ziya E, Bartholdi III JJ (2011) Centralizing inventory in supply chains by using shapley value to allocate the profits. Manufacturing & Service Operations Management 13(2):146–162.
- Leng M, Parlar M (2009) Allocation of cost savings in a three-level supply chain with demand information sharing: A cooperative-game approach. Operations Research 57(1):200–213.
- Mak HY, Max Shen ZJ (2021) When triple-a supply chains meet digitalization: The case of jd. com’s c2m model. Production and Operations Management 30(3):656–665.
- Mangasarian L (1995) Parallel gradient distribution in unconstrained optimization. SIAM Journal on Control and Optimization 33(6):1916–1925.
- Masters K (2019) Amazon releases free analytics that brands previously paid $30,000-plus per year for. Forbes URL https://www.forbes.com/sites/kirimasters/2019/02/12/amazon-releases-free-analytics-that-brands-previously-paid-30000-per-year-for/?sh=f48b36c44b4e, last Accessed on 19 August 2023.
- Shapley LS, et al. (1953) A value for n-person games .
- Stich SU (2018) Local sgd converges fast and communicates little. arXiv preprint arXiv:1805.09767 .