Space lower bound for heavy hitters when only the list (no estimates) is output
Determine whether computing the deterministic heavy hitters problem \(\msf{HeavyHitters}[n, U, k]\) in the streaming model—when the algorithm is required to output only a list of \(k\) elements containing all items with frequency at least \(n/k\) but not the frequency estimates—requires \(\Omega(k\log(n/k))\) bits of space in the worst case for integers \(n, U, k\) with \(\min\{n, U\} \gg k \ge 2\).
Sponsor
References
We remark that the proof above depends on the estimates $\wtilde{f_1},\cs,\wtilde{f_k}$. If we are only required to output a list ${u_1,\cs,u_k}$, we do not know whether we can prove the streaming lower bound. We leave this as an open problem. Prove or disprove: for any integers $n,U,k$ such that $\min{n,U}\gg k>=2$, computing $\msf{HeavyHitters}[n,U,k]$ (without outputting $\wtilde{f_1},\cs,\wtilde{f_k}$) requires $\Omegak\log(n/k)$ bits of space in the streaming model.