Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fully Dynamic k-Means Coreset in Near-Optimal Update Time (2406.19926v1)

Published 28 Jun 2024 in cs.DS

Abstract: We study in this paper the problem of maintaining a solution to $k$-median and $k$-means clustering in a fully dynamic setting. To do so, we present an algorithm to efficiently maintain a coreset, a compressed version of the dataset, that allows easy computation of a clustering solution at query time. Our coreset algorithm has near-optimal update time of $\tilde O(k)$ in general metric spaces, which reduces to $\tilde O(d)$ in the Euclidean space $\mathbb{R}d$. The query time is $O(k2)$ in general metrics, and $O(kd)$ in $\mathbb{R}d$. To maintain a constant-factor approximation for $k$-median and $k$-means clustering in Euclidean space, this directly leads to an algorithm update time $\tilde O(d)$, and query time $\tilde O(kd + k2)$. To maintain a $O(polylog~k)$-approximation, the query time is reduced to $\tilde O(kd)$.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Max Dupré la Tour (7 papers)
  2. Monika Henzinger (127 papers)
  3. David Saulpic (21 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.