Are We Still Missing an Item? (2401.06547v2)
Abstract: The missing item problem, as introduced by Stoeckl in his work at SODA 23, focuses on continually identifying a missing element $e$ in a stream of elements ${e_1, ..., e_{\ell}}$ from the set ${1,2,...,n}$, such that $e \neq e_i$ for any $i \in {1,...,\ell}$. Stoeckl's investigation primarily delves into scenarios with $\ell<n$, providing bounds for the (i) deterministic case, (ii) the static case -- where the algorithm might be randomized but the stream is fixed in advanced and (iii) the adversarially robust case -- where the algorithm is randomized and each stream element can be chosen depending on earlier algorithm outputs. Building upon this foundation, our paper addresses previously unexplored aspects of the missing item problem. In the first segment, we examine the static setting with a long stream, where the length of the steam $\ell$ is close to or even exceeds the size of the universe $n$. We present an algorithm demonstrating that even when $\ell$ is very close to $n$ (say $\ell=n-1$), polylog($n$) bits of memory suffice to identify the missing item. When the stream's length $\ell$ exceeds the size of the universe $n$ i.e. $\ell = n +k$, we show a tight bound of roughly $\Theta(k)$. The second segment focuses on the adversarially robust setting. We show a lower bound for a pseudo-deterministic error-zero (where the algorithm reports its errors) algorithm of approximating $\Omega(\ell)$, up to polylog factors. Based on Stoeckl's work and the previous result, we establish a tight bound for a random-start (only use randomness at initialization) error-zero streaming algorithm of roughly $\Theta(\sqrt{\ell})$.
- A framework for adversarially robust streaming algorithms. ACM Journal of the ACM (JACM), 69(2):1–33, 2022.
- An information complexity approach to extended formulations. In Proceedings of the forty-fifth annual ACM symposium on Theory of computing, pages 161–170, 2013.
- When a random tape is not enough: lower bounds for a problem in adversarially robust streaming. arXiv preprint arXiv:2310.03634, 2023.
- Adversarially robust coloring for graph streams. arXiv preprint arXiv:2109.11130, 2021.
- Uriel Feige. A randomized strategy in the mirror game. arXiv preprint arXiv:1901.07809, 2019.
- The space complexity of mirror games. In 10th Innovations in Theoretical Computer Science Conference (ITCS 2019). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2018.
- Communication complexity of set-disjointness for all probabilities. Theory of Computing, 12(1):1–23, 2016.
- Perfect l_p sampling in a data stream. SIAM Journal on Computing, 50(2):382–439, 2021.
- Separating adaptive streaming from oblivious streaming using the bounded storage model. In Annual International Cryptology Conference, pages 94–121. Springer, 2021.
- Mirror games against an open book player. Theoretical Computer Science, page 114159, 2023.
- Keep that card in mind: Card guessing with limited memory. arXiv preprint arXiv:2107.03885, 2021.
- Communication Complexity: and Applications. Cambridge University Press, 2020.
- Manuel Stoeckl. Streaming algorithms for the missing item finding problem. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 793–818. SIAM, 2023.