Deterministic Dynamic Execution
- Deterministic Dynamic Execution is a scheduling paradigm that enforces pre-determined resource allocation and execution order to guarantee timing-anomaly-free execution of multi-typed DAG tasks.
- The approach uses an offline simulation with WCET assignments to derive both safe and tight worst-case response time estimates under deterministic constraints.
- Empirical evaluations reveal significant reductions in worst-case response times and jitter with minimal scheduling overhead on heterogeneous systems.
Deterministic Dynamic Execution (DDE) is a scheduling paradigm introduced to guarantee timing-anomaly-free execution for dynamic scheduling of multi-typed directed acyclic graph (DAG) tasks on heterogeneous platforms. By enforcing deterministic constraints on resource allocation and task execution order, DDE eliminates the pitfalls of conventional dynamic schedulers, notably timing anomalies that complicate or invalidate standard worst-case response time (WCRT) analyses. DDE achieves both safety and tightness in WCRT estimation through a single offline simulation, with minimal overhead and favorable empirical properties even on large-scale heterogeneous systems (Zhu et al., 28 Jan 2026).
1. System and Task Model
DDE is formulated for heterogeneous platforms comprising multiple processing unit types (e.g., CPUs, GPUs), each possibly with several identical instances. Let denote the set of processing-unit types, and the instance count for type .
The task model consists of a multi-typed DAG :
- : tasks; : precedence constraints.
- : eligible resource types for each task.
- giving for
where BCET/WCET denote best/worst-case execution times, capturing workload- and resource-dependent variability.
For task :
- : start time; : finish time; (relative to the DAG source).
- System response: .
Let be the set of all feasible schedules respecting precedence and resource constraints. Then the system WCRT is:
2. Timing Anomalies in Dynamic Scheduling
A timing anomaly (TA) refers to the counterintuitive phenomenon where a locally accelerated event (e.g., a task finishes early at BCET) can increase the global WCRT relative to the all-WCET simulation bound. This undermines the validity of conventional WCRT analyses and frequently forces unsafe or overly pessimistic estimation procedures.
The paper defines a strict timing anomaly precisely. Consider two system states in the execution-progress domain (see below), with for all tasks in some , and for all future evolution, allocated execution times in are at least those in . A strict timing anomaly is exhibited if, after transitions, , i.e., the “faster” state falls behind due to dynamically shifting resource contention or precedence scheduling.
An explicit example demonstrates this: an early completion on a GPU causes downstream loss of parallelism, making a critical task await resource availability longer, prolonging the overall response beyond the offline bound (Zhu et al., 28 Jan 2026).
3. Deterministic Dynamic Execution Algorithm
3.1 Execution-Progress Model
Execution progress is represented by the domain
and , with indicating the current stage and the remaining time at that stage. A partial order is imposed:
where .
A system state maps each task to its progress, ordered pointwise: iff for all .
The state transition is defined such that in each scheduling tick, for each task :
- If ready to move stage (all dependencies met and resources available) or Exec done: transition to next stage () and set time to required allocation, otherwise decrement remaining time.
3.2 Deterministic Execution Constraints
To exclude timing anomalies, DDE enforces:
- Resource Allocation Determinism: Every task is always scheduled on a predetermined unit type :
- Execution Order Determinism: Tasks must enter Exec in a single total order extending the DAG’s partial order. Task cannot start until all have started.
The DDE scheduler executes as follows (pseudocode as stated):
1 2 3 4 5 6 7 |
while exists tick(c)==0: c ← upd(c) for each t in ReadyQueue in order O_D: if all predecessors of t in O_D have begun Exec: assign t to an idle instance of res_t set its remaining time = WCET(t, res_t) mark t as begun Exec |
4. WCRT Estimation via Offline Simulation
Given the enforced determinism, the WCRT is derived from a single offline execution:
- Assign all tasks their WCETs and simulate from all tasks in Block.
- Upon sink completion, response time is both safe (no under-approximation) and tight (no excessive pessimism), as deterministic constraints preclude timing anomaly deviation.
Formally,
where is the finish time across all ticks.
5. Construction of Deterministic Execution Constraints
DDE constraints can be generated offline by either of two methods:
- Trace Extraction: Run any dynamic scheduler (e.g., HBFS) offline under all-WCET settings. For each task, record its start time, finish time, and chosen resource. Build by sorting tasks by (ties broken by task ID), set .
- Heuristic-Based HACPA: Compute for each task a critical-path “rank”:
with . Sort tasks by descending rank. For each , choose the resource leading to earliest finish, update and in assignment order.
6. Formal Guarantees and Correctness
The DDE paradigm is supported by a series of formal results:
- Monotonicity: Under DDE, execution progress is monotonic—no fast-execution state can ever fall behind a slower one under any evolution.
- Strict TA Elimination: Lemmas ensure forward progress, dependency monotonicity, and that staged transitions preserve the progress order. The general strict TA-freedom theorem constructs a general proof by induction and repeated application of monotonicity lemmas, showing that no dynamic execution under DDE can exhibit a strict timing anomaly.
This ensures the single all-WCET simulation bound is both sound (never optimistic) and maximally precise given the enforced constraints.
7. Empirical Evaluation and Observed Impact
Experiments employed random DAG task sets generated via the model, with and edge-probability . Variability in was chosen to represent both wide and narrow WCET-BCET spreads.
Key findings:
- TA Probability: In unconstrained HFCFS, TAs occurred in up to 44.8% of tested systems at low , vanishing as DAG density increased. HBFS displayed much lower rates (.
- WCRT Reduction: In TA-affected systems, DDE imposed on HFCFS reduced WCRT by between 5–25% (best observed: 72%) relative to the unconstrained maximum observed WCRT. HACPA-derived constraints further reduced WCRT.
- Jitter: Enforced determinism cut response-time jitter by 7–9% on average.
- Average Response Time Impact: The average overhead on AVRT was minor (1–12%; avg 4.2%). In TA-affected systems, AVRT sometimes decreased (average –1.2%, up to –40%).
- Overhead: Scheduling overhead for DDE was minimal (one priority-queue lookup per ready task per tick) relative to task execution costs.
A plausible implication is that the robustness and tight WCRT estimation provided by DDE are achieved with negligible runtime cost and limited sacrifice in average-case performance, supporting its practical suitability for real-world safety- or real-time-critical heterogeneous applications (Zhu et al., 28 Jan 2026).