Trip-Based Sampling Optimization
- The paper presents a novel optimization framework that leverages timetabled bus trips and trip chains to maximize spatial-temporal sensor coverage in urban settings.
- It employs a sequential three-stage formulation—including bus-line pre-selection, minimum-fleet sizing, and sensor allocation—to efficiently manage computational complexity while ensuring high coverage.
- The joint bi-level formulation co-optimizes scheduling and sensor placement, reducing sensor requirements by up to 22% and substantially increasing grid–time coverage.
Trip-based sampling is an optimization framework for the deployment of a limited number of mobile sensors on fleet buses, aiming to maximize spatial-temporal coverage for drive-by sensing tasks (such as air quality, traffic state, and road roughness monitoring). The methodology explicitly incorporates timetabled bus trips, exploits the structure of trip chains (ordered sequences of trips served by the same bus), and reconciles operational constraints on minimal fleet size with coverage maximization, all while maintaining computational tractability at city scale (Ji et al., 2023).
1. Problem Setting and Definitions
The spatial domain is discretized into grids (e.g., ), and time is segmented into intervals of fixed length (e.g., 60 min). The bus network consists of lines, each with a fixed timetable. A trip on line is specified as : start/end terminals, scheduled departure time, and duration. Dead-heading time defines the non-service interval between consecutive trips by the same vehicle.
A trip chain is a feasible, time-respecting sequence of timetabled trips a physical bus can serve in a day. Chains are feasible only if the dead-heading constraints are satisfied: for all transitions in the chain. Up to identical sensors may be installed, each assigned to a trip chain (i.e., bus), conferring sensing ability for the whole chain.
Coverage is defined at the grid–time pair level: if at least one sensor-equipped bus is in grid during , zero otherwise. Spatial () and temporal () weights (normalized to sum to 1) model heterogeneous monitoring priorities. The global coverage (sensing reward) is
Operational constraints include complete fulfillment of the timetable with the minimal fleet size (minimum-fleet principle), ensuring sensor assignment does not compromise service.
2. Sequential Three-Stage Formulation
Stage A: Bus-Line Pre-Selection
To reduce problem size, a set cover is solved to select a minimal subset of lines covering at least a fraction of all reachable grids (with yielding full coverage). Let if line covers grid . The binary program minimizes subject to constraints ensuring sufficient grid coverage and logical consistency.
Stage B: Minimum-Fleet Sizing per Line
For each selected line , a bipartite matching is solved to minimize the number of buses required while chaining trips into feasible sequences. Variables indicate whether trip is served immediately after . The minimum fleet for line is , with the number of trips on . Matched pairs are extracted to form all trip chains .
Stage C: Sensor Allocation to Trip Chains
Sensor assignment is phrased as a 0-1 integer program over all trip chains. Binary variable flags instrumented chains. For each trip and grid–time pair, indicator marks if trip covers . Constraints ensure no more than sensors are assigned, and that every covered grid–time pair is supported by at least one equipped bus.
These distinct stages—pre-selection, fleet sizing, sensor allocation—frame the trip-based sampling approach as a sequence of linked optimization problems.
3. Joint Bi-level Formulation
The joint bi-level model addresses the sub-optimality arising from fixing trip chains in advance, instead co-optimizing scheduling and sensing assignments per line.
- Upper Level: Across all lines, integer variables distribute the available sensors, maximizing total coverage by blending information on how many sensors to assign per line (subject to per-line saturation ).
- Lower Level (per line): For a given , the problem is to select trip chains to be instrumented, optimizing the coverage contributed by that line. Variables model which chains and trip transitions are chosen, and their resulting sensing impact.
The bi-level structure is separable by line, allowing parallel solution, with sensor allocation at the upper level guided by lower-level computations of attainable coverage for each .
The two levels interact only through the mappings , with denoting grid–time coverage from line equipped with sensors.
4. Algorithmic Workflow and Computational Properties
The algorithm proceeds as follows:
- Line Pre-Selection: The set cover step significantly reduces the problem size, selecting relevant lines.
- Per-Line Optimization: For each chosen line,
- The fleet sizing (bipartite matching) is solved in time (max-flow/assignment).
- Model reduction prunes superfluous link variables where idle times exceed a threshold , preserving optimal fleet size and saving up to 90% in problem dimensionality.
- For , the pruned mixed-integer program is solved to find and associated coverage. Computation stops when further sensors do not increase coverage (at saturation ).
- Global Sensor Allocation: The upper-level knapsack-like integer program (in variables) allocates sensors to lines.
Each line's lower-level problem is independent, and the reduced after pre-selection enables sub-linear scaling in . In contrast, a naïve vehicle-based approach is combinatorial in the total number of buses or trip chains.
5. Empirical Study: Chengdu Case
A comprehensive real-world test covers within Chengdu’s 4th Ring Road, with $400$ one-kilometer grids and service from am to pm. Three temporal granularities ( min) are examined, . Spatial weights are derived from traffic and emission data.
Of $167$ bus lines, pre-selection (with ) yields lines ensuring full grid coverage. These require a minimum fleet of $684$ buses for $6,006$ trips. To achieve coverage of grid–time pairs at min, the sequential approach requires $49$ sensors; the joint bi-level model needs only $38$ (a reduction of ). The number of grids fully covered in every interval increases by $41$– under the joint model. Almost every line saturates at sensors for min, and for min.
Computation times are significantly improved after pre-selection: for fleet-sizing on all $167$ lines versus on $38$ lines; sensor allocation MILPs take minutes instead of . Pruning with min (idle time) reduces solution time by $25$– without degrading coverage.
| Aspect | Sequential Approach | Joint Bi-level Approach |
|---|---|---|
| Sensors for 90% cover | 49 | 38 |
| Increase in 100% grids | Baseline | +41–238% |
| Saturation per line | (60 min) | (60 min) |
6. Model Extensions and Practical Recommendations
Multiple model extensions are available for operational realism:
- Service gaps: Dummy trips with fixed time windows (e.g., for breaks or charging) can be inserted, with chain assignment constraints.
- Bus relocations: Forbidden by taking or penalized with a multi-objective cost term ().
- Operational costs: Additional terms for total fleet size () and dead-heading ( or ).
- Uncertain service times/speeds: Addressable through robust or stochastic variants, or corrected via subsequent data processing.
A practical rule of thumb is to assign one sensor per selected line and prioritize a second sensor to lines with large one-way trip durations, to close temporal coverage gaps for coarse .
The trip-based methodology thus tightly integrates the combinatorics of fleet scheduling with the needs of optimal spatial-temporal sensor allocation. It achieves near-optimal city-scale coverage under realistic operational constraints and computational budgets, with the decoupling by lines ensuring both tractability and deployment feasibility (Ji et al., 2023).