Papers
Topics
Authors
Recent
Detailed Answer
Quick Answer
Concise responses based on abstracts only
Detailed Answer
Well-researched responses based on abstracts and relevant paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses
Gemini 2.5 Flash
Gemini 2.5 Flash 79 tok/s
Gemini 2.5 Pro 49 tok/s Pro
GPT-5 Medium 15 tok/s Pro
GPT-5 High 15 tok/s Pro
GPT-4o 100 tok/s Pro
Kimi K2 186 tok/s Pro
GPT OSS 120B 445 tok/s Pro
Claude Sonnet 4 36 tok/s Pro
2000 character limit reached

A* shortest string decoding for non-idempotent semirings (2204.07236v2)

Published 14 Apr 2022 in cs.FL and cs.CL

Abstract: The single shortest path algorithm is undefined for weighted finite-state automata over non-idempotent semirings because such semirings do not guarantee the existence of a shortest path. However, in non-idempotent semirings admitting an order satisfying a monotonicity condition (such as the plus-times or log semirings), the notion of shortest string is well-defined. We describe an algorithm which finds the shortest string for a weighted non-deterministic automaton over such semirings using the backwards shortest distance of an equivalent deterministic automaton (DFA) as a heuristic for A* search performed over a companion idempotent semiring, which is proven to return the shortest string. While there may be exponentially more states in the DFA, this algorithm needs to visit only a small fraction of them if determinization is performed "on the fly".

Definition Search Book Streamline Icon: https://streamlinehq.com
References (17)
  1. OpenFst: a general and efficient weighted finite-state transducer library. In Implementation and Application of Automata: 12th International Conference (CIAA 2007), pages 11–23.
  2. The mathematics of statistical machine translation: parameter estimation. Computational Linguistics, 19(2):263–312.
  3. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, 39(1):1–38.
  4. Edsger W. Dijkstra. 1959. A note on two problems in connexion with graphs. Numerische Mathematik, 1(1):269–271.
  5. Kyle Gorman and Richard Sproat. 2021. Finite-State Text Processing. Morgan & Claypool.
  6. A formal basis for the heuristic determination of minimal cost paths. IEEE Transactions on Systems Science and Cybernetics, 4(2):100–107.
  7. Introduction to Automata Theory, Languages, and Computation, 3rd edition. Pearson.
  8. Frederick Jelinek and Robert L. Mercer. 1980. Interpolated estimation of Markov source parameters from sparse data. In Proceedings of the Workshop on Pattern Recognition in Practice, pages 381–397.
  9. Unsupervised analysis for decipherment problems. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 499–506.
  10. Mehryar Mohri. 1997. Finite-state transducers in language and speech processing. Computational Linguistics, 23(2):269–311.
  11. Mehryar Mohri. 2002. Semiring frameworks and algorithms for shortest-distance problems. Journal of Automata, Languages and Combinatorics, 7(3):321–350.
  12. Mehryar Mohri. 2009. Weighted automata algorithms. In Manfred Droste, Werner Kuich, and Heiko Vogler, editors, Handbook of Weighted Automata, pages 213–254. Springer.
  13. Weighted finite-state transducers in speech recognition. Computer Speech and Language, 16(1):69–88.
  14. Mehryar Mohri and Michael D. Riley. 2015. On the disambiguation of weighted automata. In Implementation and Application of Automata 20th International Conference (CIAA 2015), pages 263–278.
  15. Mehyar Mohri and Michael Riley. 2002. An efficient algorithm for the n-best-strings problem. In 7th International Conference on Spoken Language Processing, pages 1313–1316.
  16. Brian Roark and Richard Sproat. 2007. Computational Approaches to Morphology and Syntax. Cambridge University Press.
  17. Lateen EM: unsupervised training with multiple objectives, applied to dependency grammar induction. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 1269–1280.
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-Up Questions

We haven't generated follow-up questions for this paper yet.