Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Coresets for Constrained Clustering: General Assignment Constraints and Improved Size Bounds (2301.08460v6)

Published 20 Jan 2023 in cs.DS and cs.CG

Abstract: Designing small-sized \emph{coresets}, which approximately preserve the costs of the solutions for large datasets, has been an important research direction for the past decade. We consider coreset construction for a variety of general constrained clustering problems. We introduce a general class of assignment constraints, including capacity constraints on cluster centers, and assignment structure constraints for data points (modeled by a convex body $\mathcal{B}$). We give coresets for clustering problems with such general assignment constraints that significantly generalize and improve known results. Notable implications include the first $\varepsilon$-coreset for capacitated and fair $k$-Median with $m$ outliers in Euclidean spaces whose size is $\tilde{O}(m + k2 \varepsilon{-4})$, generalizing and improving upon the prior bounds in Braverman et al., FOCS' 22; Huang et al., ICLR' 23, and the first $\epsilon$-coreset of size $\mathrm{poly}(k \varepsilon{-1})$ for fault-tolerant clustering for various types of metric spaces.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Lingxiao Huang (39 papers)
  2. Jian Li (667 papers)
  3. Pinyan Lu (69 papers)
  4. Xuan Wu (59 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.