Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fast Matching of Regular Patterns with Synchronizing Counting (Technical Report) (2301.12851v1)

Published 30 Jan 2023 in cs.FL

Abstract: Fast matching of regular expressions with bounded repetition, aka counting, such as (ab){50,100}, i.e., matching linear in the length of the text and independent of the repetition bounds, has been an open problem for at least two decades. We show that, for a wide class of regular expressions with counting, which we call synchronizing, fast matching is possible. We empirically show that the class covers nearly all counting used in usual applications of regex matching. This complexity result is based on an improvement and analysis of a recent matching algorithm that compiles regexes to deterministic counting-set automata (automata with registers that hold sets of numbers).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Lukáš Holík (30 papers)
  2. Juraj Síč (7 papers)
  3. Lenka Turoňová (2 papers)
  4. Tomáš Vojnar (31 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.