A Polynomial-Time Approximation for Pairwise Fair $k$-Median Clustering (2405.10378v2)
Abstract: In this work, we study pairwise fair clustering with $\ell \ge 2$ groups, where for every cluster $C$ and every group $i \in [\ell]$, the number of points in $C$ from group $i$ must be at most $t$ times the number of points in $C$ from any other group $j \in [\ell]$, for a given integer $t$. To the best of our knowledge, only bi-criteria approximation and exponential-time algorithms follow for this problem from the prior work on fair clustering problems when $\ell > 2$. In our work, focusing on the $\ell > 2$ case, we design the first polynomial-time $O(k2\cdot \ell \cdot t)$-approximation for this problem with $k$-median cost that does not violate the fairness constraints. We complement our algorithmic result by providing hardness of approximation results, which show that our problem even when $\ell=2$ is almost as hard as the popular uniform capacitated $k$-median, for which no polynomial-time algorithm with an approximation factor of $o(\log k)$ is known.
- Constant approximation for individual preference stable clustering. In Advances in Neural Information Processing Systems (NeurIPS), 2023.
- Fair clustering via equitable group representations. In Proceedings of the Conference on Fairness, Accountability, and Transparency (FAccT), page 504–514, 2021.
- Constant-factor FPT approximation for capacitated k𝑘kitalic_k-median. In Annual European Symposium on Algorithms (ESA), volume 144 of LIPIcs, pages 1:1–1:14, 2019.
- Individual preference stability for clustering. In International Conference on Machine Learning (ICML), 2022.
- Clustering without over-representation. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 267–275, 2019.
- Centrality of trees for capacitated k-center. Math. Program., 154(1-2):29–53, 2015.
- A technique for obtaining true approximations for k𝑘kitalic_k-center with covering constraints. In International conference on integer programming and combinatorial optimization, pages 52–65. Springer, 2020.
- Scalable fair clustering. In International Conference on Machine Learning, pages 405–413, 2019.
- On coresets for fair clustering in metric and euclidean spaces and their applications. In 48th International Colloquium on Automata, Languages, and Programming (ICALP), volume 198 of LIPIcs, pages 23:1–23:15, 2021.
- A constant approximation for colorful k𝑘kitalic_k-center. In 27th Annual European Symposium on Algorithms (ESA 2019). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2019.
- Fair algorithms for clustering. In Advances in Neural Information Processing Systems, pages 4954–4965, 2019.
- On the cost of essentially fair clusterings. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM), 2019.
- Fair clustering with multiple colors. arXiv preprint arXiv:2002.07892, 2020.
- Fairness, semi-supervised learning, and more: A general framework for clustering with stochastic pairwise constraints. In Proc. Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI), 2021.
- A pairwise fair and community-preserving approach to k𝑘kitalic_k-center clustering. In International Conference on Machine Learning (ICML), pages 1178–1189, 2020.
- Matroid and knapsack center problems. Algorithmica, 75(1):27–52, 2016.
- Proportionally fair clustering. In International Conference on Machine Learning, pages 1032–1041, 2019.
- Fair clustering through fairlets. In Advances in Neural Information Processing Systems, pages 5029–5037, 2017.
- How to solve fair k𝑘kitalic_k-center in massive data models. In Proceedings of the International Conference on Machine Learning (ICML), pages 1877–1886, 2020.
- Approximating fair clustering with cascaded norm objectives. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2664–2683, 2022.
- Fair representation clustering with several protected classes. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, pages 814–823, 2022.
- The hardness of 3333-uniform hypergraph coloring. Combinatorica, 25(5):519–535, 2005.
- A tight bound on approximating arbitrary metrics by tree metrics. In Proceedings of the thirty-fifth annual ACM symposium on Theory of computing, pages 448–455, 2003.
- Socially fair k𝑘kitalic_k-means clustering. In Madeleine Clare Elish, William Isaac, and Richard S. Zemel, editors, Conference on Fairness, Accountability, and Transparency (FAccT), pages 438–448, 2021.
- Constant-factor approximation algorithms for socially fair k𝑘kitalic_k-clustering. arXiv preprint arXiv:2206.11210, 2022.
- Which lpsubscript𝑙𝑝l_{p}italic_l start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT norm is the fairest? approximations for fair facility location across all “p𝑝pitalic_p”, 2022.
- Approximation algorithms for fair range clustering. In International Conference on Machine Learning, pages 13270–13284. PMLR, 2023.
- Fair colorful k𝑘kitalic_k-center clustering. In International Conference on Integer Programming and Combinatorial Optimization, pages 209–222. Springer, 2020.
- Fair k𝑘kitalic_k-centers via maximum matching. In Proceedings of the International Conference on Machine Learning (ICML), pages 4940–4949, 2020.
- A center in your neighborhood: Fairness in facility location. In Proceedings of the Symposium on Foundations of Responsible Computing (FORC), page 5:1–5:15, 2020.
- The capacitated K-center problem. SIAM J. Discrete Math., 13(3):403–418, 2000.
- Fair k𝑘kitalic_k-center clustering for data summarization. In 36th International Conference on Machine Learning, ICML 2019, pages 5984–6003. International Machine Learning Society (IMLS), 2019.
- The matroid median problem. In Proceedings of the Symposium on Discrete Algorithms (SODA), pages 1117–1130, 2011.
- Constant approximation for k-median and k-means with outliers via iterative rounding. In Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, pages 646–659, 2018.
- Shi Li. On uniform capacitated k𝑘kitalic_k-median beyond the natural lp relaxation. ACM Transactions on Algorithms (TALG), 13(2):1–18, 2017.
- Individual fairness for k𝑘kitalic_k-clustering. In Proceedings of the International Conference on Machine Learning (ICML), pages 6586–6596, 2020.
- Approximation algorithms for socially fair clustering. In Conference on Learning Theory (COLT), pages 3246–3264. PMLR, 2021.
- Proportionally fair clustering revisited. In International Colloquium on Automata, Languages, and Programming (ICALP), 2020.
- Scalable algorithms for individual preference stable clustering. In International Conference on Artificial Intelligence and Statistics, pages 1108–1116, 2024.
- Better algorithms for individually fair k𝑘kitalic_k-clustering. Advances in Neural Information Processing Systems (NeurIPS), 34:13340–13351, 2021.
- Privacy preserving clustering with constraints. In International Colloquium on Automata, Languages, and Programming (ICALP), 2018.
- Fair coresets and streaming algorithms for fair k𝑘kitalic_k-means. In International Workshop on Approximation and Online Algorithms, pages 232–251, 2019.
- Improved approximation algorithms for individually fair clustering. In International Conference on Artificial Intelligence and Statistics (AISTATS), pages 8758–8779. PMLR, 2022.