TradeFM: Multi-Domain Market & Model Framework

Updated 2 March 2026

TradeFM is a multi-domain framework that integrates generative modeling, market-based allocation, and auction mechanisms to optimize diverse systems.
It employs rigorous mathematical techniques, including scale-invariant tokenization, linear programming, and reinforcement learning to enhance performance and generalization.
Validated across finance, air traffic flow, and federated learning, TradeFM demonstrates improved synthetic data generation, operational efficiency, and model trading accuracy.

TradeFM encompasses several distinct frameworks and models across domains, each leveraging market or foundation-model-based mechanisms to address allocation, generative modeling, or collaborative optimization. These include: (1) a generative foundation model for trade flow and market microstructure, (2) a market-based allocation protocol for air traffic flow management, and (3) an auction-driven marketplace for model trading in federated learning. Each variant is underpinned by rigorous mathematical formulations and domain-specific equilibrium, generative, or auction-theoretic machinery.

1. TradeFM in Market Microstructure Modeling

TradeFM, introduced as a “Generative Foundation Model for Trade-flow and Market Microstructure,” is a 524M-parameter decoder-only Transformer tailored for financial market event streams (Kawawa-Beaudan et al., 27 Feb 2026). The architecture comprises 32 Llama-style layers with grouped-query attention, rotary positional encodings, a model width of $d_{\text{model}}=1024$ , and a feed-forward dimension of $d_{\text{ff}}=4096$ . The model operates autoregressively: at each step $t$ , it predicts the next trade token $i_{\text{trade},t}$ based on all prior discrete tokens $i_{<t}$ .

The input per step is a tuple $[i_l, I_{MP}, i_{\Delta p_t}, i_{\text{trade}}]$ , reflecting liquidity bin, market/participant indicator, price-level change bin, and a composite trade token. The training objective is cross-entropy maximization over a discretized vocabulary comprising 16,384 composite tokens.

A key methodological advancement is the construction of scale-invariant features—interarrival time, log-transformed volume, normalized price depth, and relative mid-price—discretized via quantile or log-binning to remove scale and unit dependencies across assets. The mid-price $\hat{p}_t^{\rm EW\text{-}VWAP}$ is estimated using an exponentially-weighted VWAP, robustly adapting to multi-asset scenarios.

This universal tokenization, combined with robust feature engineering, permits TradeFM to generalize zero-shot across geographies and liquidity tiers, circumventing asset-level calibration.

2. Closed-Loop Simulation and Stylized Facts

TradeFM integrates with a deterministic market simulator maintaining a limit order book with price-time priority. In a closed loop: TradeFM samples the next trade event token, decodes it, the simulator updates the LOB and mid-price, and updated context tokens are fed to the next prediction step.

This rollout mechanism enables evaluation of whether generated order flow reproduces stylized empirical facts, including:

Heavy-tailed log-return distributions (kurtosis $\kappa(\Delta t_r)$ for short intervals).
Volatility clustering (slow decay of $|\text{returns}|$ autocorrelation).
Absence of return autocorrelation at standard lags.
Emergent aggregational Gaussianity at larger time scales (kurtosis approaching 3).

TradeFM's rollouts were shown to quantitatively align with real-data statistics across these metrics.

3. Quantitative Evaluation and Cross-Market Generalization

The benchmark includes Kolmogorov–Smirnov (KS) and Wasserstein-1 (W₁) distances for log-return and order flow features, compared to Compound Hawkes process baselines. Across intervals from 10–120s, TradeFM achieves approximately 2–3x lower KS and W₁ distances versus Hawkes models; e.g., at $10$ s returns, TradeFM's KS is $0.013$ versus Hawkes' $0.039$.

The model generalizes robustly, with zero-shot perplexity degradations of less than 20% on APAC equities, supporting the claim that scale-invariant representations enable transfer across diverse markets without per-asset retraining.

TradeFM's main applications are synthetic data generation (for privacy-preserving backtesting and illiquid asset augmentation), stress-testing via counterfactual market evolution, reinforcement learning for optimal execution and agent-based modeling, and statistical transfer learning across market regimes (Kawawa-Beaudan et al., 27 Feb 2026).

4. TradeFM in Air Traffic Flow Management

A separate line of research coined the “TradeFM” moniker for a market-based mechanism in air traffic flow management (ATFM), focusing on efficiency and fairness in landing slot allocations (Vazirani, 2011). This framework establishes a market where airlines purchase reductions in flight delays.

Goods traded: units of landing delay.
Agents: each flight (buyer) and the FAA (seller of landing slots with delay).
Private parameter: each flight's “criticality factor” $a_i>0$ , representing the dollar cost per unit delay.
Supply/demand: flights demand exactly one slot, with slot capacities $c(s)$ ; delays $d_i(s)$ are slot-specific.

The core mathematical formulation is a primal-dual LP for minimizing total weighted delay cost: $\min_{x_{is}} \sum_{i \in A} \sum_{s \in W(i)} a_i d_i(s) x_{is}$

$\text{subject to}: \sum_{s \in W(i)} x_{is} \ge 1,\quad \sum_{i: s \in W(i)} x_{is} \le c(s),\quad x_{is} \ge 0$

The dual LP yields equilibrium slot prices $p_s$ and total flight costs $t_i$ with Walrasian equilibrium properties. The constraint matrix is totally unimodular, ensuring integral allocation (i.e., binary $x_{is}$ ). A minimum-cost perfect $b$ -matching in a bipartite graph (flights and slots, plus a dummy vertex) yields the allocation and prices efficiently via strongly polynomial algorithms.

The model's fairness paradigm is “pay for priority”: higher $a_i$ flights secure lower delays, while slack capacity slots are priced at zero. FAA revenue can be recycled to subsidize low- $a_i$ flights, achieving a spectrum of service levels (Vazirani, 2011).

5. TradeFM as a Marketplace for Federated Learning

In federated learning, TradeFM refers to an auction-based market where clients trade models to maximize both revenue and downstream task accuracy (Cui et al., 2024). Each client in round $t$ acts as:

Seller of its current local model parameters $A_i^t$ .
Buyer submitting a vector $b_i^t$ (bid per unit of expected performance gain) for others' models.

Key constructs:

Performance gain for buyer $i$ at round $t$ :

$G_i^t = \frac{M_i^t - M_i^0}{M_i^0},\quad \Delta G_i^t = G_i^t - G_i^{t-1}$

where $M_i^t$ is post-aggregation accuracy.

Auction mechanism: server selects authorized sellers in each round, collects bids, computes allocation and payment via functions $f_a$ and $f_p$ , enforcing monotonicity and critical-bid pricing (generalized second-price).
Theoretical guarantees: truthfulness is optimal under restricted-copy and selective-authorization conditions.
Payments: per-unit price for each winner is determined by the next-best bid, scaled by realized gain.
Post-auction aggregation: FedAvg across purchased models.

To maximize trading revenue and fairness, allocation is learned adaptively by an advantage actor-critic (A2C) reinforcement learning policy. The allocation action $a^t$ is $a^t = b^t \circ \pi(s^t;\theta)$ , where $\pi$ is a softmax actor over state $s^t$ (bids, model and performance info from authorized sellers). The reward includes total revenue and fairness penalties.

TradeFM empirically outperforms standard GSP auctions in both revenue and accuracy (e.g., on MNIST: $68.0 \pm 3.8$ vs. $45.4 \pm 4.5$ in trading volume, $97.8\%$ vs. $97.3\%$ in accuracy), and remains robust against diverse bidding strategies (Cui et al., 2024).

6. Comparative Overview

Setting	Key Mechanism	Objective/Pricing Rule
Financial microstructure (Kawawa-Beaudan et al., 27 Feb 2026)	Generative Transformer, scale-invariant tokens	Synthetic event simulation, distributional fidelity
Air traffic (Vazirani, 2011)	Market for delay/slot allocation (LP + b-matching)	Minimize total weighted delay, Walrasian equilibrium prices
Federated learning (Cui et al., 2024)	Auction market, learning-based allocation (RL)	Maximize trade revenue, accuracy, incentive compatibility

Each variant of TradeFM adapts “market,” “foundation model,” or “auction” principles to the algebraic, statistical, or social utility structures of its respective domain. All share an orientation toward optimizing allocation under information asymmetry and scalable computation.

7. Context, Significance, and Implications

TradeFM's financial foundation model demonstrates that scale-invariant tokenization enables cross-asset generalization, outperforming classical statistical point-process models and providing a pathway for broader synthetic data generation and agent-based research. In federated learning, the TradeFM market paradigm operationalizes distributed model exchange as auction-enabled resource sharing with provable incentive properties and measurable gains in economic and accuracy metrics. The air traffic TradeFM instantiation efficiently resolves resource allocation via algorithmic market design, delivering integral equilibria and polynomial complexity.

A plausible implication is that abstracting heterogeneous, dynamic systems into market or generative frameworks with scale-agnostic features or truthful-incentive mechanisms yields transferable, robust solutions in both algorithmic and economic terms. The various TradeFM frameworks collectively illustrate the confluence of machine learning, optimization, and market theory in contemporary systems engineering and data-driven modeling (Vazirani, 2011, Cui et al., 2024, Kawawa-Beaudan et al., 27 Feb 2026).