2000 character limit reached
Constructing LZ78 Tries and Position Heaps in Linear Time for Large Alphabets (1501.06619v1)
Published 26 Jan 2015 in cs.DS
Abstract: We present the first worst-case linear-time algorithm to compute the Lempel-Ziv 78 factorization of a given string over an integer alphabet. Our algorithm is based on nearest marked ancestor queries on the suffix tree of the given string. We also show that the same technique can be used to construct the position heap of a set of strings in worst-case linear time, when the set of strings is given as a trie.