Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
GPT-4o
Gemini 2.5 Pro Pro
o3 Pro
GPT-4.1 Pro
DeepSeek R1 via Azure Pro
2000 character limit reached

Messenger RNA Design via Expected Partition Function and Continuous Optimization (2401.00037v2)

Published 29 Dec 2023 in q-bio.BM, cs.AI, and cs.LG

Abstract: The tasks of designing RNAs are discrete optimization problems, and several versions of these problems are NP-hard. As an alternative to commonly used local search methods, we formulate these problems as continuous optimization and develop a general framework for this optimization based on a generalization of classical partition function which we call "expected partition function". The basic idea is to start with a distribution over all possible candidate sequences, and extend the objective function from a sequence to a distribution. We then use gradient descent-based optimization methods to improve the extended objective function, and the distribution will gradually shrink towards a one-hot sequence (i.e., a single sequence). As a case study, we consider the important problem of mRNA design with wide applications in vaccines and therapeutics. While the recent work of LinearDesign can efficiently optimize mRNAs for minimum free energy (MFE), optimizing for ensemble free energy is much harder and likely intractable. Our approach can consistently improve over the LinearDesign solution in terms of ensemble free energy, with bigger improvements on longer sequences.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. Principles for predicting RNA secondary structure design difficulty. Journal of molecular biology, 428(5):748–757, 2016.
  2. Efficacy and safety of the mRNA-1273 SARS-CoV-2 vaccine. New England Journal of Medicine, 384(5):403–416, 2021.
  3. Designing RNA secondary structures is hard. Journal of Computational Biology, 27(3):302–316, 2020.
  4. Natural selection and algorithmic design of mRNA. Journal of Computational Biology, 10(3-4):419–432, 2003.
  5. Francis Crick. Central dogma of molecular biology. Nature, 227(5258):561–563, 1970.
  6. Contrafold: RNA secondary structure prediction without physics-based models. In Proceedings 14th International Conference on Intelligent Systems for Molecular Biology 2006, Fortaleza, Brazil, August 6-10, 2006, pages 90–98, 2006. doi: 10.1093/bioinformatics/btl246. URL https://doi.org/10.1093/bioinformatics/btl246.
  7. The chemical repertoire of natural ribozymes. Nature, 418(6894):222–228, 2002.
  8. Sean R Eddy. Non–coding rna genes and the modern rna world. Nature Reviews Genetics, 2(12):919–929, 2001.
  9. RNAiFOLD: a constraint programming algorithm for RNA inverse folding and molecular design. Journal of bioinformatics and computational biology, 11(02):1350001, 2013.
  10. LinearFold: linear-time approximate RNA folding by 5’-to-3’ dynamic programming and beam search. Bioinformatics, 35(14):i295–i304, 2019. doi: 10.1093/bioinformatics/btz375. URL https://doi.org/10.1093/bioinformatics/btz375.
  11. Inverse rna folding workflow to design and test ribozymes that include pseudoknots. Methods in molecular biology, 2167:113–143, 2021. URL https://api.semanticscholar.org/CorpusID:220796617.
  12. Viennarna package 2.0. Algorithms Mol. Biol., 6:26, 2011. doi: 10.1186/1748-7188-6-26. URL https://doi.org/10.1186/1748-7188-6-26.
  13. Differentiable Partition Function Calculation for RNA. Nucleic Acids Research, 2023.
  14. mRNA structure regulates protein expression through changes in functional half-life. Proceedings of the National Academy of Sciences, 116(48):24075–24083, 2019.
  15. J. S. McCaskill. The equilibrium partition function and base pair probabilities for RNA secondary structure. Biopolymers, 29:1105–19, 1990.
  16. Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine. New England journal of medicine, 383(27):2603–2615, 2020.
  17. Fernando Portela. An unexpectedly effective Monte Carlo technique for the RNA inverse folding problem. BioRxiv, page 345587, 2018.
  18. CDSfold: an algorithm for designing a protein-coding sequence with the most stable secondary structure. Bioinformatics, 32(6):828–834, 2016.
  19. Wikipedia. DNA and RNA codon tables. 2023. URL https://en.wikipedia.org/wiki/DNA_and_RNA_codon_tables#/media/File:Aminoacids_table.svg. Wikipedia, The Free Encyclopedia.
  20. A new coronavirus associated with human respiratory disease in china. Nature, 579(7798):265–269, 2020.
  21. Nucleic acid sequence design via efficient ensemble defect optimization. Journal of computational chemistry, 32(3):439–452, 2011.
  22. LinearPartition: linear-time approximation of RNA folding partition function and base-pairing probabilities. Bioinformatics, 36(Supplement_1):i258–i267, 2020.
  23. Algorithm for optimized mRNA design improves stability and immunogenicity. Nature, 621(7978):396–403, 2023. ISSN 1476-4687. doi: 10.1038/s41586-023-06127-z. URL https://doi.org/10.1038/s41586-023-06127-z.
  24. RNA design via structure-aware multifrontier ensemble optimization. Bioinformatics, 39(Supplement_1):i563–i571, 2023.
  25. Undesignable RNA Structure Identification via Rival Structure Generation and Structure Decomposition. To appear in Proceedings of RECOMB 2024, 2024. URL https://arxiv.org/pdf/2311.08339.pdf.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com