- The paper introduces EvoGraph, a framework that uses hybrid directed graph evolution and SLMs to autonomously modernize legacy software.
- It demonstrates significant improvements with a 93% semantic equivalence in language translation and reduced latency and lead times over traditional methods.
- Leveraging cost-effective SLMs, EvoGraph supports multi-language modernization while enforcing safety-aware, multi-objective optimization.
EvoGraph: Hybrid Directed Graph Evolution toward Software 3.0
The paper "EvoGraph: Hybrid Directed Graph Evolution toward Software 3.0" introduces EvoGraph, a sophisticated framework designed to autonomously evolve software systems, particularly focusing on legacy code modernization. The system leverages hybrid directed graph evolution, integrating small LLMs (SLMs) to apply mutations and assess fitness across various software artefacts. EvoGraph addresses critical challenges in legacy modernization, offering efficiency and semantic equivalency via cost-effective SLMs.
Framework Overview
EvoGraph capitalizes on directed graphs to represent and manage software artefacts, including source code, build pipelines, documentation, and other operational elements. Each node within the graph corresponds to a specific artefact type, such as code or documentation, while edges represent dependencies and relationships. This representation enables comprehensive management and modernization of legacy systems by capturing dynamic aspects like runtime traces, often overlooked by static analyzers.
Mutation operators like Weight Merge and Code Patch facilitate the evolution of these graphs, informed by expert systems for each respective language. EvoGraph's integration of safety-aware multi-objective selection mechanisms ensures that evolutionary changes not only improve system utility but also comply with safety and performance constraints (Figure 1).
Figure 1: EvoGraph system architecture showing the hybrid directed graph evolution process. Left: Artefact graph representation with typed nodes (code, docs, build, etc.) and mutation operators. Right: evoGraph specialization where language-specific SLMs handle targeted mutations for different legacy languages, coordinated by a central controller that maintains safety constraints and fitness optimization.
Experimental Validation
The framework was empirically validated against a variety of benchmarks representing real-world scenarios. EvoGraph demonstrated substantial improvements over existing methods, achieving a 93% functional equivalence in COBOL to Java translations and an 83% correction rate for security vulnerabilities on test systems. Moreover, it significantly reduced latency and feature lead times, outperforming traditional baselines such as hybrid automated program repair tools (Table 1).
Task |
Metric |
Copilot |
APR |
DGM |
EvoGraph |
OSS Bank CVEs fixed |
/18 (up) |
6 |
4 |
9 |
15 |
Tele Shop p95 latency |
ms (down) |
420 |
395 |
330 |
250 |
Lead time |
h (down) |
168 |
120 |
24 |
3 |
COBOL to Java equiv. |
% (up) |
-- |
-- |
71 |
93 |
Doc freshness BLEU |
(up) |
0.33 |
0.31 |
0.47 |
0.82 |
Table 1: Main experimental results comparing EvoGraph against baselines.
The performance metrics established EvoGraph's superior capability to manage multi-objective optimization tasks that are crucial for effective software modernization.
Implications of SLMs
A standout feature of EvoGraph is its use of SLMs, which provide a substantial reduction in computational costs without compromising performance. SLMs are specifically tuned for different legacy languages, capturing nuanced patterns and reducing the need for extensive computational resources. Compared to large models like GPT-4, SLMs offer an economical alternative for agentic AI applications, which is a crucial consideration for enterprises operating with budget constraints but requiring high-efficiency processing.
EvoGraph harnesses SLMs not only to streamline operations but also to facilitate rapid iteration and deployment flexibility, supporting on-premises modernization and edge computing solutions.
Multi-Language Modernization
EvoGraph's ability to support multi-language modernization is demonstrated through its capability to achieve high semantic equivalency across diverse language pairs. This flexibility highlights its potential for wider adoption in heterogeneous codebase environments.
Language Pair |
LOC |
Sem. Equiv. |
Perf. Gain |
Mod. Time |
COBOL → Java |
12,000 |
93% |
2.1× |
3h |
.NET → Java |
25,000 |
87% |
1.5× |
5h |
Lisp → Python |
8,000 |
91% |
1.3× |
2h |
Table 2: Multi-language modernization results showing semantic equivalence, performance gains, and modernization times across different language pairs.
The seamless modular architecture of EvoGraph allows it to integrate new languages with ease. Such adaptability positions EvoGraph as a valuable asset for tackling a broad range of modernization projects across industries.
Conclusion
EvoGraph presents a sophisticated approach for software modernization through directed graph-based evolution and the utilization of SLMs. Its results indicate not only practical improvements in key performance metrics but also alignment with industry needs for cost efficiency and flexibility in legacy modernization initiatives. EvoGraph's demonstrated ability to improve semantic equivalence while significantly cutting costs supports the use of SLMs in agentic systems, providing a compelling argument for their further adoption and development in this field.