Lempel-Ziv Factorization May Be Harder Than Computing All Runs (1409.5641v1)

Published 19 Sep 2014 in cs.DS

Abstract: The complexity of computing the Lempel-Ziv factorization and the set of all runs (= maximal repetitions) is studied in the decision tree model of computation over ordered alphabet. It is known that both these problems can be solved by RAM algorithms in $O(n\log\sigma)$ time, where $n$ is the length of the input string and $\sigma$ is the number of distinct letters in it. We prove an $\Omega(n\log\sigma)$ lower bound on the number of comparisons required to construct the Lempel-Ziv factorization and thereby conclude that a popular technique of computation of runs using the Lempel-Ziv factorization cannot achieve an $o(n\log\sigma)$ time bound. In contrast with this, we exhibit an $O(n)$ decision tree algorithm finding all runs in a string. Therefore, in the decision tree model the runs problem is easier than the Lempel-Ziv factorization. Thus we support the conjecture that there is a linear RAM algorithm finding all runs.

Citations (19)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Lempel-Ziv Computation In Compressed Space (LZ-CICS) (2015)
Computing Runs on a General Alphabet (2015)
The "Runs" Theorem (2014)
Faster Compact On-Line Lempel-Ziv Factorization (2013)
Computing Lempel-Ziv Factorization Online (2012)