Practical Repetition-Aware Grammar Compression (1910.13479v1)

Published 29 Oct 2019 in cs.DS

Abstract: The goal of grammar compression is to construct a small sized context free grammar which uniquely generates the input text data. Among grammar compression methods, RePair is known for its good practical compression performance. MR-RePair was recently proposed as an improvement to RePair for constructing small-sized context free grammar for repetitive text data. However, a compact encoding scheme has not been discussed for MR-RePair. We propose a practical encoding method for MR-RePair and show its effectiveness through comparative experiments. Moreover, we extend MR-RePair to run-length context free grammar and design a novel variant for it called RL-MR-RePair. We experimentally demonstrate that a compression scheme consisting of RL-MR-RePair and the proposed encoding method show good performance on real repetitive datasets.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

RePair Grammars are the Smallest Grammars for Fibonacci Words (2022)
Grammar Compression By Induced Suffix Sorting (2020)
MR-RePair: Grammar Compression based on Maximal Repeats (2018)
RePair in Compressed Space and Time (2018)
Entropy bounds for grammar compression (2018)