Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

On Estimating the First Frequency Moment of Data Streams (1005.0809v1)

Published 5 May 2010 in cs.DS

Abstract: Estimating the first moment of a data stream defined as $F_1 = \sum_{i \in {1, 2, \ldots, n}} \abs{f_i}$ to within $1 \pm \epsilon$-relative error with high probability is a basic and influential problem in data stream processing. A tight space bound of $O(\epsilon{-2} \log (mM))$ is known from the work of [Kane-Nelson-Woodruff-SODA10]. However, all known algorithms for this problem require per-update stream processing time of $\Omega(\epsilon{-2})$, with the only exception being the algorithm of [Ganguly-Cormode-RANDOM07] that requires per-update processing time of $O(\log2(mM)(\log n))$ albeit with sub-optimal space $O(\epsilon{-3}\log2(mM))$. In this paper, we present an algorithm for estimating $F_1$ that achieves near-optimality in both space and update processing time. The space requirement is $O(\epsilon{-2}(\log n + (\log \epsilon{-1})\log(mM)))$ and the per-update processing time is $O( (\log n)\log (\epsilon{-1}))$.

Summary

We haven't generated a summary for this paper yet.