Papers
Topics
Authors
Recent
Search
2000 character limit reached

Electric Vehicle Routing with Time Windows

Updated 28 January 2026
  • EVRPTW is a complex optimization problem combining routing, scheduling, and recharging decisions for electric vehicles within strict time and energy constraints.
  • Advanced methodologies include MILP formulations, metaheuristic algorithms, and reinforcement learning, enhancing route feasibility and operational efficiency.
  • Extensions address uncertainty, partial/nonlinear charging, shared mobility, and V2G integration, enabling robust and sustainable logistics solutions.

The Electric Vehicle Routing Problem with Time Windows (EVRPTW) is a constrained combinatorial optimization problem central to sustainable logistics, aiming to route a fleet of electric vehicles (EVs) to serve customer demands within specified time intervals while respecting battery, cargo, and operational constraints. The problem’s intricacy stems from tightly coupled decision variables—routing, recharging, scheduling—under the complex operational realities of electric fleets. Advanced modeling integrates partial and nonlinear charging, charging station queuing, time-dependent travel, uncertainty, shared mobility, grid interactions (vehicle-to-grid), pickup-delivery couplings, and operational heterogeneity, as well as exact, metaheuristic, and learning-based solution methodologies.

1. Mathematical and Operational Formulation

The canonical EVRPTW is defined on a directed graph G=(V,A)G=(V, A) with depot(s), set CC of customers, set FF of charging stations, and, in extensions, variants such as pickup-delivery pairs, discharging stations, or time-dependent arcs. Each customer ii specifies a demand qiq_i and a time window [ei,li][e_i, l_i]. Each route is subject to cargo capacity QQ, battery capacity BB, and trip-specific energy costs eije_{ij} that depend on distance, vehicle load, and sometimes exogenous factors (e.g., weather or traffic for stochastic consumption).

Decision variables typically include route-selection binaries (xijkx_{ij}^k), arrival times (tikt_i^k), state-of-charge (SoCikSoC_i^k), and per-stop charge quantities (cikc_i^k), with possible per-route, per-customer, and per-charging station variations.

The objective can lexicographically minimize (i) number of vehicles, (ii) total operational cost (distance/time), and (iii) resource usage (energy/cost), as in the MILP and hybrid models presented in (Bruglieri et al., 2015, Khanda et al., 2 Aug 2025, Köster et al., 19 Dec 2025), and (Duc et al., 9 Sep 2025). Constraints enforce unique customer visitation, flow conservation, time-window adherence, battery/energy propagation, and, in advanced versions, further operational couplings.

Significantly, advanced formulations address:

2. Core Algorithmic and Metaheuristic Approaches

Due to the problem’s NP-hardness, scalable solution techniques combine exact and metaheuristic strategies:

A schematic overview of main strategies is provided:

Methodology Key Features Scalability
MILP (exact) Tightest model, optimal on small instances N ≤ 15–30
VNSB/HMA/ALNS/ACO Domain-tailored metaheuristics, destroy/repair, cross-domain N ≤ 100 or more
Compositional (SMT/CP) Decomposition into submodels, supports conflicts/robustness Industrial-scale when conflict structure is critical
RL/DRL-based Learning for high-dimensional/rapid dispatch, generalization N ≤ 100–1000 (inference)

3. Extensions: Uncertainty, Sharing, V2G, and Advanced Constraints

Energy Consumption Uncertainty and Adaptive Robustness

Energy uncertainty is introduced via polyhedral uncertainty sets. Adaptive robust EVRPTW (AR-EVRPTW) is modeled as a two-stage min–max–min program, routing decisions are fixed before energy realization, with recourse for adaptive charging, and master-subproblem decomposition exploiting VNS and alternating direction (AD) algorithms as in (Jeong et al., 2022). Increased uncertainty budgets yield higher energy costs but dramatically increase route robustness and feasibility.

Time-Dependency, Shared Mobility, and Nonlinear Charging

Time-dependent EVRPTW frameworks model per-arc travel times as step functions (e.g., peaks/valleys for congestion), queueing at charging stations, partial charging, and different infrastructure types. Shared-mobility variants introduce pickup–drop-off pairings, in-vehicle passenger couplings, and time-window pairing for shared rides (Yazdiani et al., 16 Sep 2025). Nonlinear charging is approximated by piecewise-linear segments, with activation through binary variables and convex combinations. Results show significant reductions (up to 20%) in total vehicle-kilometers with small penalties in travel time.

Vehicle-to-Grid and Grid Price-Integrated Models

Some EVRPTWs integrate active grid-feedback: vehicles can discharge stored energy at grid node(s) under time-variant prices (Lin et al., 2020, Narayanan et al., 2022), with the objective function trading off travel/delivery cost and price-based discharge revenue. This links route and charge/discharge scheduling tightly to grid economics and time-of-use cost curves.

4. Deep Learning and Reinforcement Learning Approaches

Deep RL frameworks (e.g., curriculum-based DRL (Daysalilar et al., 21 Jan 2026), Edge-DIRECT (Mozhdehi et al., 2024), attention-pointer RL (Lin et al., 2020)) leverage graph-based encoders with spatial, temporal, energy, and time-window contextualization, and decoder architectures that explicitly model the heterogeneity of EVs and charging stations. Curriculum learning decomposes the constraint set incrementally (first routing, then energy, then strict time windows), mitigating policy collapse and improving feasibility under tight EVRPTW constraints (Daysalilar et al., 21 Jan 2026). Edge-enhanced attention, time-window overlap graph representation, and dual-attention decoders (vehicle and node) directly boost empirical performance, especially under highly heterogeneous fleet or network settings (Mozhdehi et al., 2024).

Key empirical observations:

  • Properly structured DRL architectures with domain priors (e.g., explicit SoC masking, charging detours) can generalize models trained on N=10 to N=100; feasibility and cost performance are competitive or superior to metaheuristics for large NN (Daysalilar et al., 21 Jan 2026, Lin et al., 2020, Mozhdehi et al., 2024).
  • RL approaches can infer competitive routes 10–100× faster than metaheuristics, albeit with a moderate optimality gap (typically <20%) (Lin et al., 2020, Narayanan et al., 2022).
  • Limitations include the lack of feasibility guarantees, difficulties with stochastic/dynamic inputs, and current restriction to static network data.

5. Industrial-Scale, Conflict-Free, and Multi-Period Routing

Conflict-free variants with road-segment capacity constraints, often motivated by automated or plant-based EV routing, require strict schedule separation. The problem is addressed by compositional SMT solving, with “master” routing and “checker” scheduling phases—the latter injecting “conflict cuts” generated from unsatisfiable core analysis (Roselli et al., 2021, Roselli et al., 2022, Roselli et al., 2021). This approach drastically outperforms monolithic encodings and guarantees completeness for practical plant layouts.

Multi-day EVRPTW is realized by introducing separator nodes, time-window linking across days, and flexible appointment planning, with specialized tabu search, ALNS, and ACO metaheuristics (Köster et al., 19 Dec 2025). For small problem sizes (N ≤ 15), MIP is optimal; for larger cases, tailored metaheuristics achieve near-optimal quality orders of magnitude faster, supporting realistic, high-variance daily appointment chains.

6. Benchmarking, Empirical Findings, and Managerial Insights

Extensive computational benchmarking shows that heuristic and metaheuristic algorithms provide near-optimal solutions at far larger scale than exact solvers (Table 1 below), with hybrid metaheuristics (HMA), cluster-sort-assign (CSA), and compositional solvers consistently outperforming generic baselines (EDF/NDF) by 10–40% in operational cost (Zheng et al., 2024, Khanda et al., 2 Aug 2025).

Approach Instance Scale (N) Typical Gaps vs. Optimal Solve Time (indicative)
MILP N ≤ 15–20 0% >103 s
HMA/LNS/CDNS 20 < N ≤ 200 <5% (avg), <2% (best) 1–300 s
CSA (cluster-sort) N ≈ 200 <1 delivery lost, ~20% cost reduction vs. EDF ~101 s
DRL/Edge-DIRECT N ≤ 100 2–10% gap, much faster ~100 s (greedy), ~103 s (sampling)

Key insights:

  • Partial recharging and flexibility in charging station visit timing are crucial for feasible routing and marked reductions in “time outside depot” (ΔTTOD ~ −30%) (Bruglieri et al., 2015, Zheng et al., 2024).
  • Introduction of time-dependent and shared mobility factors enables significant reductions in fleet miles with only minor increases in per-vehicle time, providing robust evidence for urban deployment of shared EV fleets (Yazdiani et al., 16 Sep 2025).
  • Adaptive robust models accommodate inherent uncertainties in consumption, providing a tunable tradeoff between operational cost and reliability (out-of-sample feasibility reaching >95%) (Jeong et al., 2022).
  • Conflict-free formulations with compositional SMT allow practical industrial deployment when capacity or collision constraints are present.

7. Future Directions and Methodological Gaps

Recent research points toward several frontiers:

  • Stochastic and dynamic routing: Inclusion of uncertainty in travel times, demands, and real-time order insertion remains a major challenge.
  • Integration of nonlinear, dynamically varying charging curves, and real-time grid coupling: Fine-grained models of charging behavior, battery degradation, and time-variant electricity prices demand hybrid optimization-stochastic control.
  • Hybridization of RL with metaheuristics may combine neural inference speed with local search optimality refinement.
  • Scalability beyond N=1000 likely requires hierarchical decompositions, distributed optimization, or real-time learning-based feedback (dynamic RL).
  • Open-source benchmark sets of real-world scale and complexity, coupled with metaheuristic and learning baselines, such as those introduced in (Zheng et al., 2024), are pivotal for progress.

The field continues to advance toward increasingly realistic, robust, and scalable models and solution methods, with cross-fertilization from operations research, constraint programming, and machine learning fueling rapid methodological evolution.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (15)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Electric Vehicle Routing Problem with Time Windows (EVRPTW).