Adaptive Fused Dual-domain Seasonal MLP
- Adaptive Fused Dual-domain Seasonal MLP is a neural forecasting module that separates robust frequency signals and subtle time-domain patterns for accurate seasonal predictions.
- It employs parallel MLP branches—with FFT-based frequency processing and time-domain weak signal detection—that are adaptively fused using learnable channel-wise weights.
- Empirical results from architectures like MDMLP-EIA and xPatch show improved MSE/MAE metrics, confirming its effectiveness in capturing intricate seasonal patterns.
An Adaptive Fused Dual-domain Seasonal MLP is an architectural module for time series forecasting that decomposes seasonality into strong and weak spectral components, processes them through dedicated frequency- and time-domain MLP branches, and adaptively fuses their predictions using a learnable channel-wise mechanism. This approach is designed to capture both high- and low-energy seasonal signals, mitigate noise amplification in weak channels, and yield robust, accurate forecasts within an efficient, channel-independent neural forecasting framework (Zhang et al., 13 Nov 2025, Stitsyuk et al., 23 Dec 2024). Two recent instantiations include the MDMLP-EIA architecture, which introduces a frequency/time-split with adaptive channel fusion, and the dual-stream xPatch, which fuses MLP and CNN streams after exponential seasonal-trend decomposition.
1. Motivation and Context
Neural time series forecasting models, particularly channel-independent MLP-based methods, excel in efficiency and parameter economy but traditionally suffer from two weaknesses: inefficient integration of weak seasonal signals and poor noise handling in the fusion of seasonality features. Addressing these gaps requires separating seasonality into orthogonal domains (frequency and time), applying domain-specialized feature extraction, and adaptively fusing their outputs. Classical anomalies—where strong periodicities are well-modeled but weak, informative cycles are discarded as noise—necessitate this dual-domain approach. Adaptive fusion, with minimal additional parameters, counters overfitting and noise magnification while harnessing all informative signals in a scalable way (Zhang et al., 13 Nov 2025).
2. Problem Formulation and Seasonal Decomposition
Given a multivariate input series (length , channels), the input is first normalized (RevIN) and then decomposed via Exponential Moving Average (EMA):
- Trend extraction: For ,
(trend), (seasonal + residual).
- In MDMLP-EIA, the seasonal signal is modeled as , distinguishing dominant from weak periodicities by spectral energy (Zhang et al., 13 Nov 2025). In xPatch, the residual from EMA represents all seasonality, passed for nonlinear extraction (Stitsyuk et al., 23 Dec 2024).
3. Dual-domain Model Architecture
The Adaptive Fused Dual-domain Seasonal MLP comprises parallel paths for the seasonal component:
Frequency-domain branch (strong seasonality)
- Embedding and real FFT: Transform to frequency domain.
- FreMLP: Two-layer MLP processes FFT coefficients.
- Inverse FFT: Reconstruct time-frequency features.
- StrongSeasonalMLP: Maps features to forecast horizon.
Formally:
Time-domain branch (weak seasonality)
- WeakSeasonalMLP: Two-layer MLP (, dropout) on (shape ) for low-energy cycles, output .
Fusion via Adaptive Zero-initialized Channel Fusion (AZCF)
- Fused prediction:
with , initialized at zero, learned per channel, and broadcast over the output horizon. This mechanism ensures weak-branch integration only where it reduces loss, minimizing noise risk (Zhang et al., 13 Nov 2025).
In xPatch (Stitsyuk et al., 23 Dec 2024), a related structure uses an MLP linear stream for trend and a CNN nonlinear stream for the EMA residual, with late fusion by concatenation followed by a projection.
4. Integration With Forecasting Pipeline
Within MDMLP-EIA (Zhang et al., 13 Nov 2025):
- Decomposition: Input → RevIN + EMA → (trend), (season)
- Trend branch: → TrendMLP (→ per channel)
- Seasonal branch: → [Freq and time MLPs] → AZCF fusion →
- Energy-Invariant Attention (EIA): Merges trend () and seasonal () predictions:
is produced by a channel-attention MLP. The factor 2 keeps the -energy of invariant under all .
- Final output: Inverse RevIN.
In xPatch (Stitsyuk et al., 23 Dec 2024), after EMA decomposition, each channel processes its trend part via linear MLPs and its seasonality via patch embedding, grouped depthwise CNNs, and MLP heads; outputs are concatenated and linearly combined for each forecast vector.
5. Implementation and Complexity Considerations
Key architectural parameters (Zhang et al., 13 Nov 2025):
- Dynamic Capacity Adjustment: Number of MLP neurons per layer times the input length, ensuring scalability.
- MLPs: Channel-independent weight sharing, LeakyReLU (strong), (weak), dropout $0.1$.
- Fusion parameters: AZCF introduces only new parameters; EIA adds .
- Overall computational complexity: per step.
For xPatch (Stitsyuk et al., 23 Dec 2024), EMA smoothing is grid-searched and fixed, not learned. CNN seasonal stream uses patching, GUEL activations, batch norm, depthwise and pointwise convolutions for signal-specific nonlinear modeling. Fusion is via concatenation and a final linear projection.
6. Empirical Performance and Benchmarking
Ablation studies in MDMLP-EIA show (Zhang et al., 13 Nov 2025):
- MSE/MAE (five datasets, ; lower is better):
| Model Variant | MSE | MAE | |:-------------:|:-----:|:-----:| | w/o WeakSeasonal | 0.339 | 0.355 | | MLP fusion | 0.341 | 0.355 | | AZCF (ours) | 0.338 | 0.353 |
- On ETTh2, AZCF yields lower MSE compared to "w/o WS" and over standard fusion. The method preserves full oscillation amplitude and extreme values in long-horizon scenarios, outperforming baselines that fail to recover low-energy cycles.
xPatch achieves state-of-the-art mean MSE/MAE on 60–90% of benchmarks under various hyperparameter regimes, consistently outperforming prior non-transformer models (CARD, TimeMixer, PatchTST) by 2–8% absolute MSE and MAE margins (Stitsyuk et al., 23 Dec 2024). Each structural component—including dual-domain design and fusion—offers additive improvements in ablation, confirming the contribution of adaptive, fused dual-domain modeling for robust and accurate prediction.
7. Significance, Limitations, and Extensions
The Adaptive Fused Dual-domain Seasonal MLP advances channel-independent neural forecasting by:
- Systematically recovering both dominant and subtle periodicities.
- Limiting noise amplification via learnable, regularized per-channel fusion.
- Enabling scalable capacity through dynamic adjustment tied to channel count.
The AZCF rule is parameter-efficient ( vs for full attention). A plausible implication is that similar adaptive, domain-split architectures may generalize to other multivariate sequence domains where weak signals carry crucial predictive information.
Limitations include marginal incremental gains for some datasets and fixed (rather than learned) EMA factors in certain variants (Stitsyuk et al., 23 Dec 2024). Future research may explore more expressive domain decompositions, non-linear fusion rules, or domain-adaptive smoothing parameters to further enhance model flexibility.
References:
- MDMLP-EIA: "MDMLP-EIA: Multi-domain Dynamic MLPs with Energy Invariant Attention for Time Series Forecasting" (Zhang et al., 13 Nov 2025)
- xPatch: "xPatch: Dual-Stream Time Series Forecasting with Exponential Seasonal-Trend Decomposition" (Stitsyuk et al., 23 Dec 2024)