Compositional Online Learning for Multi-Objective System Co-Design

Published 24 Apr 2026 in math.OC and eess.SY | (2604.22624v1)

Abstract: Many engineered systems must balance competing objectives, such as performance and safety, cost and reliability, or efficiency and sustainability, and are naturally modeled as compositions of interacting subsystems. We study online multi-objective decision-making in monotone co-design, where functionalities and resources are partially ordered, and the goal is to identify the target-feasible antichain of non-dominated trade-offs using few expensive evaluations. We introduce optimistic evaluators: history-dependent bounds on functionality and resource mappings that enable safe elimination of implementations before full evaluation. Based on these evaluators, we develop an elimination-based rejection-sampling algorithm, prove its soundness, and show that the admissible region shrinks monotonically as information accumulates. We instantiate the framework under monotonicity, Lipschitz continuity, and linear-parametric structure. For compositional co-design problems modeled by multigraphs, we show how local optimistic certificates propagate through the tractable remainder of the graph to yield system-level optimistic feasibility and resource bounds. Experiments on multi-robot fleet design, intermodal mobility systems, and synthetic monotone and Lipschitz benchmarks show substantial sample-efficiency gains over uniform sampling, Bayesian optimization, and multi-objective evolutionary algorithms.

Abstract PDF Upgrade to Chat

Authors (3)

Summary

The paper presents an order-theoretic framework that formally captures Pareto-optimal trade-offs using compositional design principles.
It introduces an online learning protocol with safe elimination, reducing costly evaluations by propagating optimistic bounds.
Empirical results on synthetic and real-world benchmarks show up to 3x performance gains over state-of-the-art multi-objective optimization methods.

Compositional Online Learning for Multi-Objective System Co-Design: Formal Summary and Analysis

Problem Formulation and Order-Theoretic Foundations

The paper "Compositional Online Learning for Multi-Objective System Co-Design" (2604.22624) establishes a rigorous order-theoretic framework for multi-objective system design, emphasizing compositionality, partial orders, and antichains to capture sets of Pareto-optimal trade-offs between functionalities and resources. Unlike scalarization or Euclidean reduction, system-level design is modeled directly on arbitrary posets, enabling reasoning about coupled subsystems and their interfaces within multigraph-structured co-design diagrams.

Definitions for monotone maps, antichains, lattice operations, and upper closures enable canonical representation of non-dominated solutions, forming the foundation for the compositional approach. The mapping between implementations, functionalities, and resources is via monotone relations, preserving native order structures and ensuring compositional reasoning across interconnected subsystems.

Online Multi-Objective Learning and Elimination-Based Sampling

The paper introduces a formal online learning protocol where an agent sequentially queries candidate implementations from an unknown subsystem. The objective is to recover the antichain of non-dominated, target-feasible resource trade-offs with minimal expensive evaluations, guided by the order structure of functionalities and resources.

Optimistic evaluators are constructed as history-dependent lower bounds for required resources and upper bounds for provided functionalities. The agent employs these evaluators to safely eliminate candidate implementations before incurring costly evaluations based on two criteria: (a) optimistic resource bounds are already dominated by the current antichain or (b) optimistic functionality bounds fail to satisfy the target.

Figure 1: Example for an implementation $DarkOrange2{i}$ eliminated by optimistic evaluation; left: functionality filtering, right: antichain dominance in the resource lattice.

The induced admissible set shrinks monotonically as information accumulates due to the monotonicity of the evaluators, guaranteeing sound elimination. The rejection sampler algorithm focuses evaluation budget on candidates that remain potentially informative, achieving safe elimination and monotone tightening.

Structural instantiations are given for monotonicity, Lipschitz continuity, and linear-parametric settings. For monotone resource maps, optimistic lower bounds are propagated via join operations over queried predecessors; for Lipschitz maps, bounds exploit local smoothness; for linear models, the confidence set over parameter vectors tightens with accumulated data.

Compositional Propagation and Multigraph System-Level Reasoning

Extending to compositional multi-block co-design, the framework leverages local optimistic evaluators in the expensive subsystem and propagates these through the tractable remainder of the multigraph-based system via monotone operators and fixed-point computation. The completion set for a candidate in the expensive block is solved over the tractable part, yielding propagated optimistic system-level resource antichains.

Safe elimination is possible at the system level via the propagated antichain: if all completions are dominated by the current antichain, the local candidate can be discarded. Monotonicity in history ensures that admissible sets shrink for both local and compositional settings, yielding correctness guarantees for the elimination sampler in the multigraph framework.

Numerical Experiments: Synthetic and Realistic Co-Design Case Studies

The paper benchmarks the approach against leading multi-objective evolutionary algorithms (MOEA/D, NSGA-III, RVEA, KGB), Bayesian optimization (qLogEHVI), and structure-free sampling (Halton sequence). Experiments span synthetic monotone and Lipschitz settings, as well as realistic co-design problems (intermodal mobility and heterogeneous multi-agent fleet design).

On synthetic monotone and Lipschitz benchmarks, elimination-based online learning achieves the lowest cumulative hypervolume difference ( $HVD$ ) across nearly all tested instances, outperforming evolutionary and Bayesian baselines by margins up to 3x. The improvement over Halton demonstrates the value of structural elimination beyond uniform coverage.

Figure 2: Hypervolume difference $HVD$ vs. iterations for intermodal mobility co-design; structural elimination substantially reduces budget for exact Pareto recovery.

Figure 3: $HVD$ vs. iterations for heterogeneous multi-agent system co-design; the compositional algorithm improves over quasi-random search and evolutionary baselines.

In real-world co-design benchmarks, the structural elimination method achieves rapid exact Pareto recovery with far fewer expensive evaluations. Ablation studies show robustness to mild model misspecification via relaxed rejection rules, and per-step CPU time remains two orders-of-magnitude below the cost of a single co-design simulation.

Theoretical and Practical Implications

The proposed framework enables modular, sample-efficient online learning directly over compositional systems with possibly partially ordered, heterogeneous interfaces. The order-theoretic paradigm allows integrating continuous and discrete components, propagating subsystem-level optimism to system-level feasibility certificates, and focusing evaluation resources on informative candidates.

Implications include:

Rigorous guarantees for safe elimination and monotone tightening, independent of particular performance metrics.
Substantial reduction in sample complexity for expensive simulations, critical for engineering domains such as mobility planning and robotics.
A formal pathway for integrating structural knowledge (monotonicity, smoothness, linearity) into practical system design optimization workflows.

Future Directions

Open problems identified include the derivation of quantitative sample complexity bounds for admissible set shrinkage, extension to coordinated exploration across multiple expensive subsystems, incorporation of stochastic confidence bounds for noisy evaluations, and development of advanced acquisition strategies (e.g., information-directed sampling, pessimistic prioritization).

Conclusion

The paper sets forth a compositional, order-theoretic framework for online multi-objective system co-design, providing algorithms and guarantees for optimally efficient exploration under structural assumptions and in compositional diagrams. Empirical and formal results demonstrate significant gains in sample efficiency for expensive design domains, with broad applicability to modular engineering systems and multidisciplinary optimization.

Markdown Report Issue