Papers
Topics
Authors
Recent
Search
2000 character limit reached

Channel-wise Conditional Prompting (CCP)

Updated 1 April 2026
  • Channel-wise Conditional Prompting (CCP) is a method integrating channel state into DeepJSCC, enabling dynamic adaptation to varying wireless conditions.
  • It employs a Channel State Prompt (CSP) mechanism that modulates intermediate feature maps via element-wise multiplication and Transformer fusion.
  • Empirical results show significant improvements in PSNR and memory efficiency, demonstrating CCP's scalability and robustness across diverse channel environments.

Channel-wise Conditional Prompting (CCP), as instantiated by the Channel State Prompt (CSP) mechanism in Prompt JSCC (PJSCC), is a differentiable architectural approach for integrating physical channel state information into deep learning-based joint source-channel coding (DeepJSCC). This design enables wireless semantic communication systems to adapt dynamically to varying signal-to-noise ratios (SNR) and channel distributions without the need for multiple specialized models, addressing fundamental generalization limitations in prior DeepJSCC architectures (Zhang et al., 2024).

1. Architectural Structure of CSP in DeepJSCC

The CSP module is embedded within the PJSCC pipeline between Swin-Transformer stages in both encoder (EϕE_\phi) and decoder (DθD_\theta) branches. In the encoder, an intermediate feature map y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w} from preceding Transformer Feature Extraction (TFE) blocks is passed to the CSP:

  • GAP+MLP Feature Extraction: Global Average Pooling (GAP) reduces y1\mathbf{y}_1 spatially, and an MLP maps it to a compact vector y2∈Rc′\mathbf{y}_2 \in \mathbb{R}^{c'} representing channel-aware features.
  • Prompt Retrieval: Based on the current channel state (βSNR,βdis)(\beta_{SNR}, \beta_{dis}), a prompt tensor pije∈Rc′×h′×w′\mathbf{p}_{ij}^e \in \mathbb{R}^{c' \times h' \times w'} is selected from the prompt set Cγ\mathbf{C}_\gamma.
  • Element-wise Modulation: The compact feature y2\mathbf{y}_2 and prompt tensor pije\mathbf{p}_{ij}^e are combined via element-wise multiplication to yield DθD_\theta0.
  • Spatial Expansion: A convolution is applied to DθD_\theta1 to restore spatial dimensions.
  • Transformer Fusion: This modulated feature is concatenated with the original DθD_\theta2, processed through a Transformer block and a convolution to yield the output feature DθD_\theta3.

Decoder-side CSPs are structurally identical, operating on feature maps within DθD_\theta4 and using dedicated prompt tensors DθD_\theta5. CSP modules are inserted between each pair of Transformer blocks along both encoding and decoding pathways (Zhang et al., 2024).

2. Mathematical Formulation of Prompt Generation and Modulation

The CSP employs a discrete set of learnable prompt tensors DθD_\theta6, indexed by all combinations of SNR levels and channel distributions. For DθD_\theta7 CSP stages and DθD_\theta8 channel states:

DθD_\theta9

For input feature y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}0, channel-aware projection:

y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}1

Retrieval and fusion use the matching prompt tensor y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}2:

y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}3

y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}4

y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}5

Only the prompt from y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}6 corresponding to the sampled y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}7 is used at each forward pass.

3. Feature and Prompt Fusion Strategy

CSP’s fusion scheme consists of three stages:

  1. Channel-wise Modulation: Element-wise multiplication of the compacted feature vector and prompt tensor aligns semantic information with channel state.
  2. Spatial Merging: A small convolutional operator reconstructs the modulated feature into full spatial dimensions for the next network stage.
  3. Transformer Fusion: Concatenation with the original feature map is followed by a Transformer block and convolution for joint feature aggregation.

This design provides a fully differentiable, unified mechanism for channel-adaptive feature modulation, allowing the backbone network to learn to prioritize information transmission based on granular channel conditions.

4. Training Objective and Adaptability

The PJSCC framework with CSP modules is trained end-to-end using mean-squared error (MSE) loss between the input image y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}8 and its reconstruction y1∈Rc×h×w\mathbf{y}_1 \in \mathbb{R}^{c \times h \times w}9:

y1\mathbf{y}_10

where y1\mathbf{y}_11 and y1\mathbf{y}_12 denotes the received signal realizations. Model parameters y1\mathbf{y}_13, y1\mathbf{y}_14, and prompts y1\mathbf{y}_15, are optimized jointly. Sampling from across SNR ∈ [1, 13] dB and both AWGN and Rayleigh channels during training ensures generalization to a diverse range of conditions without subsequent fine-tuning or separate model deployments (Zhang et al., 2024).

5. Empirical Performance

Comprehensive evaluations demonstrate that the inclusion of CSP modules in PJSCC delivers superior image reconstruction quality over ablated and baseline systems. On CIFAR-10 under AWGN, CBR = 1/3, the reported PSNR (dB) at various SNR points is:

Method 1 dB 4 dB 7 dB 10 dB 13 dB
w/ CSP (PJSCC) 30.30 33.28 35.92 38.17 39.97
w/ AF 29.64 32.64 34.84 36.83 38.58
w/o CSP 29.56 32.50 34.68 36.65 38.40

Consistent improvements are observed on LPIPS metrics as well. Over full PSNR vs SNR curves and multiple datasets (CIFAR-10, Kodak, CLIC), PJSCC with CSP outperforms DeepJSCC, ADJSCC, WITT, and BPG+LDPC across both AWGN and Rayleigh channels. These gains indicate the efficacy of channel-aware prompting in enabling robust semantic transmission without a proliferation of models (Zhang et al., 2024).

6. Memory Efficiency and Scalability

PJSCC with CSP modules achieves competitive inference times and computational cost. For high-resolution images, it requires 52.3 ms and 39.5 G FLOPs per sample; for low-resolution, 62.4 µs and 767 M FLOPs. This is comparable to WITT and an order of magnitude more efficient than ADJSCC and DeepJSCC (y1\mathbf{y}_1610× FLOPs).

In terms of model storage, PJSCC-U can integrate both AWGN and Rayleigh channels in a single model, resulting in significantly reduced memory requirements compared to baselines that require separate models for each channel type:

Model One Channel Two Channels
PJSCC-U 178 MB 216 MB
WITT 120 MB 240 MB
ADJSCC 131 MB 262 MB
DeepJSCC 1469 MB 2938 MB

This storage compactness is attributed to the modularity and sharing of learnable prompts, eliminating the need for redundant parameterization per channel environment.

7. Ablation Studies and Robustness

Ablations establish the critical value of CSP prompts compared to both simple SNR concatenation (AF) and no prompting. Across all SNRs, using CSP increases PSNR by ≈1 dB over AF and ≈1.4 dB over no-prompt baselines. Each prompt tensor y1\mathbf{y}_17 is sized to match the spatial and channel dimensions of the corresponding feature map stage (e.g., y1\mathbf{y}_18 in high-resolution blocks). Robustness tests confirm that the model generalizes seamlessly to previously unseen intermediate SNR values and both supported channel types, maintaining high reconstruction quality without additional fine-tuning or retraining (Zhang et al., 2024).


CCP, as operationalized via CSP in PJSCC, constitutes a lightweight, fully differentiable prompt-based conditioning mechanism. It incorporates structured channel priors and achieves dynamic adaptation, memory efficiency, and high-fidelity semantic transmission across a broad spectrum of wireless conditions, all within a single unified DeepJSCC model (Zhang et al., 2024).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Channel-wise Conditional Prompting (CCP).