Discrete Traffic State Encoding
- DTSE is a discrete representation of traffic states, encoding vehicle occupancy, speed, and signal phases into finite spatial and velocity cells.
- It employs systematic quantization methods in both deep reinforcement learning and kinetic traffic models, balancing granularity and computational tractability.
- DTSE enhances traffic control performance by reducing delays and queue lengths and accurately reproducing traffic dynamics, underpinning adaptive signal control strategies.
A Discrete Traffic State Encoding (DTSE) is a formalism by which the state of a vehicular traffic system is represented as a collection of discrete units—either spatial, velocity, or phase variables—to enable efficient and information-dense state description for control, simulation, and analytic purposes. DTSEs have been developed and utilized in both deep reinforcement learning for traffic signal control and in kinetic theory models of vehicular traffic, providing a flexible and precise bridge between microscopic reality and macroscopic modeling. The following sections systematically detail DTSE’s foundations, mathematical definitions, construction procedures, and impact in recent research.
1. Formal Foundations of Discrete Traffic State Encoding
The core principle underlying DTSE is the partition of position and velocity space into finite, discrete sets, such that the state of traffic at any moment is fully specified by occupancy and vehicle properties in each cell and velocity class. In the reinforcement learning for traffic control context, (Genders et al., 2016) defines the instantaneous state as
where:
- indicates presence (1) or absence (0) of a vehicle in spatial cell of lane ,
- is the normalized speed for occupied cells (else zero),
- is a one-hot phase vector representing the current traffic signal phase.
Analogously, (Fermo et al., 2012) describes DTSE in a kinetic theory framework as placing each vehicle into one of finitely many pairs, where marks spatial cell, and labels velocity class. The state space is then
and the system is described by the distribution function .
This formalization enables systemic discretization of both space and velocity, such that all dynamical, control, or analytical processes are executed on a finite-dimensional space.
2. DTSE Construction Methodologies
DTSE construction proceeds via sequential quantization and encoding steps. In the reinforcement learning setting (Genders et al., 2016), the process entails:
- Parameter Selection: Determining the observation horizon and cell length ; cells per lane.
- Lane Discretization: Partition of roadway segment into contiguous cells for each of incoming lanes.
- Occupancy Encoding : Each vehicle detected in on lane is mapped to for .
- Speed Encoding : For each occupied cell, retrieve speed and set ; else set to zero.
- Phase Encoding : Generate a one-hot vector specifying the current phase among the set of available actions.
In the kinetic theory approach (Fermo et al., 2012), the road is divided into cells of length , and velocities are quantized into classes. The discrete state is thus , with the population measured by .
Both frameworks rely on consistent and lossless mappings from raw simulation or measurement data to the respective discrete state representations, ensuring preservation of spatial and velocity granularity.
3. Mathematical Dynamics and Numerical Implementation
Reinforcement Learning DTSE: The DTSE forms the input to a deep CNN agent. The network processes and as separate channels, using convolutional towers with the following architecture (Genders et al., 2016):
- Conv1: 16 filters, size , stride 2, ReLU activation.
- Conv2: 32 filters, size , stride 1, ReLU activation.
- Towers are flattened, concatenated with , then processed by two dense layers (sizes 128 and 64, ReLU).
- Final linear layer produces Q-values for action selection.
Kinetic Theory DTSE: Dynamic evolution is governed by a set of ODEs for : where is a flux-limiter (enforcing local density constraints), and encodes acceleration, braking, and tandem interactions via probabilistic kernels . Numerical integration employs explicit Euler schemes under CFL-type conditions to ensure stability.
4. Parameter Choices and Representation Rationale
Reinforcement Learning Example Parameters (Genders et al., 2016):
- (approximate span of 15 car lengths), (average vehicle length), ,
- lanes,
- phases,
- Input tensor: -dimensional for , plus 4-dimensional .
This configuration captures individual vehicle granularity while maintaining computational tractability. Observation and quantization parameters reflect a balance between spatial/velocity resolution and input dimensionality.
Kinetic Theory Example Parameters (Fermo et al., 2012):
- cells, velocity classes, , , , , time step .
- Key parameters (road condition), (anticipation).
These parameters allow macroscopic traffic patterns—such as free/congested flow phase transition at empirically realistic critical densities—to emerge from a finite, interpretable set of microstates.
5. Expressiveness and Analytical Properties
DTSE provides a rich, yet tractable, embedding of the traffic state. In the deep RL context, the DTSE outperforms low-dimensional state representations, yielding:
- 82% reduction in cumulative delay,
- 66% reduction in average queue length,
- 20% reduction in average travel time,
- Comparable throughput to baselines
(Genders et al., 2016). The scheme’s expressive power arises from explicit modeling of individual vehicle positions and velocities per cell, as opposed to summary statistics (e.g., queue length per lane).
In kinetic traffic models, discrete state encoding enables exact reproduction of granular effects—such as queue formation behind bottlenecks and stop-and-go traffic at signals—through local and pairwise state updates (Fermo et al., 2012). The emergence of fundamental diagrams (flow vs. density, speed vs. density) and phase transitions is obtained directly from the dynamics over the discrete state space.
6. Applications in Traffic Control and Modeling
DTSE has proven especially impactful in:
- Deep RL for Adaptive Traffic Signal Control: Feeding DTSE into convolutional agents controlling intersection signals in micro-simulated environments (SUMO) enables context-sensitive, adaptive policies with robust performance improvements over traditional methods (Genders et al., 2016).
- Analytically Tractable Kinetic Models: DTSE forms the phase space for ODE systems in kinetic traffic theories, supporting rigorous paper of large-scale, realistic networks while encapsulating microscopic features (Fermo et al., 2012).
- Reproducibility and Benchmarking: Because the encoding is formally defined and constructed directly from raw data or simulation, DTSE-based studies are reproducible and facilitate robust cross-comparison.
These applications collectively demonstrate that information-dense, finite-dimensional representations such as DTSE are foundational to modern approaches for modeling, controlling, and analyzing vehicular traffic systems.