Evolutionary Algorithms Simulating Molecular Evolution: A New Field Proposal (2403.08797v2)
Abstract: The genetic blueprint for the essential functions of life is encoded in DNA, which is translated into proteins -- the engines driving most of our metabolic processes. Recent advancements in genome sequencing have unveiled a vast diversity of protein families, but compared to the massive search space of all possible amino acid sequences, the set of known functional families is minimal. One could say nature has a limited protein "vocabulary." The major question for computational biologists, therefore, is whether this vocabulary can be expanded to include useful proteins that went extinct long ago, or maybe never evolved in the first place. We outline a computational approach to solving this problem. By merging evolutionary algorithms, ML, and bioinformatics, we can facilitate the development of completely novel proteins which have never existed before. We envision this work forming a new sub-field of computational evolution we dub evolutionary algorithms simulating molecular evolution (EASME).
- From artificial evolution to computational evolution: a research agenda. Nature Reviews - Genetics, 7:729 – 735.
- Modeling emergence of Wolbachia toxin-antidote protein functions with an evolutionary algorithm. Frontiers in Microbiology, 14.
- A Genetic Programming Approach to Engineering MRI Reporter Genes. ACS Synthetic Biology, 12(4):1154–1163. PMID: 36947694.
- Deoxyribonucleic acid polymerase from the extreme thermophile thermus aquaticus. Journal of Bacteriology, 127(3):1550–1557.
- Identifying patterns in multiple biomarkers to diagnose diabetic foot using an explainable genetic programming-based approach. Future Generation Computer Systems, 140:138–150.
- Robson, B. (2022). De novo protein folding on computers. benefits and challenges. Computers in Biology and Medicine, 143:105292.
- Molecular dynamics simulation of an entire cell. Frontiers in Chemistry, 11.
- The optimality of the standard genetic code assessed by an eight-objective evolutionary algorithm. BMC Evolutionary Biology, 18(1):192.