Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Equilibria of Data Marketplaces with Privacy-Aware Sellers under Endogenous Privacy Costs (2402.08826v1)

Published 13 Feb 2024 in cs.GT

Abstract: We study a two-sided online data ecosystem comprised of an online platform, users on the platform, and downstream learners or data buyers. The learners can buy user data on the platform (to run a statistic or machine learning task). Potential users decide whether to join by looking at the trade-off between i) their benefit from joining the platform and interacting with other users and ii) the privacy costs they incur from sharing their data. First, we introduce a novel modeling element for two-sided data platforms: the privacy costs of the users are endogenous and depend on how much of their data is purchased by the downstream learners. Then, we characterize marketplace equilibria in certain simple settings. In particular, we provide a full characterization in two variants of our model that correspond to different utility functions for the users: i) when each user gets a constant benefit for participating in the platform and ii) when each user's benefit is linearly increasing in the number of other users that participate. In both variants, equilibria in our setting are significantly different from equilibria when privacy costs are exogenous and fixed, highlighting the importance of taking endogeneity in the privacy costs into account. Finally, we provide simulations and semi-synthetic experiments to extend our results to more general assumptions. We experiment with different distributions of users' privacy costs and different functional forms of the users' utilities for joining the platform.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. Low-cost learning via active data procurement. In Proceedings of the Sixteenth ACM Conference on Economics and Computation, pages 619–636, 2015.
  2. A. Acquisti and J. Grossklags. Privacy and rationality in individual decision making. IEEE Security & Privacy, 3(1):26–33, 2005.
  3. A marketplace for data: An algorithmic solution. In Proceedings of the 2019 ACM Conference on Economics and Computation, pages 701–726, 2019.
  4. Americans and privacy: Concerned, confused and feeling lack of control over their personal information. https://www.pewresearch.org/internet/2019/11/15/americans-and-privacy-concerned-confused-and-feeling-lack-of-control-over-their-personal-information/, 2019. Pew Research.
  5. Markets for information: An introduction. Annual Review of Economics, 11:85–107, 2019.
  6. The design and price of information. American economic review, 108(1):1–48, 2018.
  7. Information design: A unified perspective. Journal of Economic Literature, 57(1):44–95, 2019.
  8. Optimum statistical estimation with strategic data sources. In Conference on Learning Theory, pages 280–296, 2015.
  9. Optimal data acquisition for statistical estimation. In Proceedings of the 2018 ACM Conference on Economics and Computation, pages 27–44, 2018.
  10. Prior-free data acquisition for accurate statistical estimation. In Proceedings of the 2019 ACM Conference on Economics and Computation, pages 659–677, 2019.
  11. Optimal data acquisition with privacy-aware agents. In 2023 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), pages 210–224. IEEE, 2023.
  12. Accuracy for sale: Aggregating data with a variance constraint. In Proceedings of the 2015 conference on innovations in theoretical computer science, pages 317–324, 2015.
  13. Incentive compatible regression learning. Journal of Computer and System Sciences, 76(8):759–777, 2010.
  14. The algorithmic foundations of differential privacy. Foundations and Trends® in Theoretical Computer Science, 9(3–4):211–407, 2014.
  15. Optimal and differentially private data acquisition: Central and local mechanisms. Operations Research, 2023.
  16. Brian Fung. Tax prep companies shared private taxpayer data with google and meta for years, congressional probe finds. https://edition.cnn.com/2023/07/12/tech/tax-prep-companies-taxpayer-data-google-meta/index.html, 2023. CNN News.
  17. Selling privacy at auction. In Proceedings of the 12th ACM conference on Electronic commerce, pages 199–208, 2011.
  18. Pricing private data. Electronic Markets, 25:109–123, 2015.
  19. Conservative or liberal? personalized differential privacy. In 2015 IEEE 31st International Conference on Data Engineering, pages 1023–1034, 2015.
  20. Learning to incentivize: Eliciting effort via output agreement. arXiv preprint arXiv:1604.04928, 2016.
  21. Sequential peer prediction: Learning to elicit effort using posted prices. In Thirty-First AAAI Conference on Artificial Intelligence, 2017.
  22. Surrogate scoring rules and a dominant truth serum for information elicitation. arXiv preprint arXiv:1802.09158, 2018.
  23. Shiona McCallum. Meta settles cambridge analytica scandal case for $725m. https://www.bbc.com/news/technology-64075067, 2022.
  24. Algorithms for strategyproof classification. Artificial Intelligence, 186:123–156, 2012.
  25. Strategyproof classification. ACM SIGecom Exchanges, 10(3):21–25, 2011.
  26. Paypal. List of third parties (other than paypal customers) with whom personal information may be shared. https://www.paypal.com/ie/legalhub/third-parties-list, 2023.
  27. Juan Perote and J Perote-Pena. The impossibility of strategy-proof clustering. Economics Bulletin, 4(23):1–9, 2003.
  28. Conducting truthful surveys, cheaply. In Proceedings of the 13th ACM Conference on Electronic Commerce, pages 826–843, 2012.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com