Feedback Alignment in Neural Networks

Updated 28 March 2026

Feedback Alignment (FA) is a biologically plausible learning rule that uses fixed random matrices to embed target information into hidden neural representations.
FA eliminates the need for weight symmetry and synchronous updates, addressing major biological implausibilities inherent in backpropagation.
Both experimental and theoretical studies demonstrate that FA and its variants achieve competitive performance, offering pathways for scalable and energy-efficient training.

Feedback Alignment (FA) is a synaptic learning rule for artificial neural networks that replaces the exact transmission of forward weights required by backpropagation (BP) with fixed, often random, matrices for propagating target signals or errors in the backward pass. By removing the requirement for weight symmetry and synchronous, global updates, FA addresses key biological implausibility issues of BP. Recent theoretical advances have reframed FA as a mechanism for embedding target information into hidden representations, supporting its utility across a wide range of architectures without sacrificing performance relative to BP.

1. Biological Motivation and Formulation

Backpropagation, the dominant algorithm in modern neural network training, demands two mechanisms regarded as biologically implausible: weight symmetry (the “weight transport problem,” requiring backward propagation of precise synaptic strengths) and globally synchronous, layer-ordered backward passes. FA, introduced by Lillicrap et al., circumvents these by using a fixed, typically random feedback matrix instead of the exact transpose of the forward weights for error propagation. This modification enables local, asynchronous, and architecture-agnostic learning, with empirical support across linear, multilayer, convolutional, and recurrent networks (Cheng et al., 2023).

Formally, for a network with input $x \in \mathbb{R}^d$ , output $y \in \mathbb{R}^p$ , and output prediction $\hat{y}$ , FA splits the network into an encoder $h = \sigma(W_I x) \in \mathbb{R}^m$ and decoder $\hat{y} = W_O h$ . The prediction error $e = \hat{y} - y$ is propagated to the hidden code $h$ via a fixed random matrix $B \in \mathbb{R}^{m \times p}$ :

$\Delta H_{FA} = B e.$

This utilizes $B$ as a channel through which target information, not just gradient estimates, is injected into the representation $h$ (Cheng et al., 2023).

2. Information-Theoretic Perspective and Dynamics

Cheng and Brown proposed that FA should be interpreted as a process of target information embedding. Under continuous-time idealized dynamics, the update for the batch of hidden activations $H \in \mathbb{R}^{m \times n}$ is:

$\frac{dH}{dt} = -B(W_O H - Y).$

This ODE drives $H$ toward a subspace where $W_O H \approx Y$ , ensuring information about $y$ is embedded in $h$ and increasing the mutual information $I(h; y)$ during training (Cheng et al., 2023). The regression loss $L_R(H) = \frac{1}{2}\|H\theta - Y\|^2$ ( $\theta$ is the least-squares decoder) decreases monotonically under FA flow, provided $B$ is full rank.

Experimental and analytical evidence shows that the FA update is initially orthogonal to the true gradient but progressively aligns as $H$ approaches subspaces embedding the target. After convergence, the signals become largely orthogonal again, indicating no further information needs embedding (Cheng et al., 2023).

3. Mathematical Guarantees, Variants, and Regularization

Theoretical results in the linear regime establish rigorous convergence conditions for FA:

In over-parameterized settings ( $r \geq \mathrm{rank}(Y)$ ), FA converges to the global minimizer for low-rank matrix factorization and deep linear networks (Garg et al., 2021, Girotti et al., 2021).
In under-parameterized settings ( $r < \mathrm{rank}(Y)$ ), FA may yield suboptimal solutions, with error bounded away from optimal matrix approximation (Garg et al., 2021).
FA and gradient descent (GD) can yield nearly orthogonal hidden representations, even with similar output error (Garg et al., 2021).
Empirical studies confirm robust convergence for both continuous and discrete-time FA when proper initialization ensures implicit regularization—large singular modes of the target mapping are learned first, analogous to principal components in GD (Girotti et al., 2021).

FA itself admits several biologically informed variants (Cheng et al., 2023):

Variant	Update Channel	Biological Analogy
Standard FA	Fixed random matrix $B$	Random anatomical projections
Noisy FA (NFA)	$B_t = B + \varepsilon_t$ (Gaussian noise)	Synaptic fluctuation, drift
Network FB (NF)	Nonlinear, random, fixed network $g(e)$	Dendritic/interneuron nonlinearity
Target FB (TF)	Target $y$ via $B$ , plus decorrelation term	Global broadcast, neuromodulation

Each variant preserves monotonic information embedding; NFA, in particular, models representational drift seen in neuronal population codes (Cheng et al., 2023).

4. Conservation Laws, Alignment, and Practical Implications

FA exhibits an inherent conservation law that links neuronwise alignment of forward and feedback weights to growth in weight norms. Specifically, for (leaky) ReLU FA or sign-FA, the change in alignment $\langle W_{i+1}[j,:], B_{i+1}[j,:] \rangle$ is precisely balanced by the increase in the squared norm of incoming weights $W_i[:,j]$ . This self-maintains an acute angle between forward and feedback weights and supports convergence under an alignment-dominance condition (Robertson et al., 2023).

Empirical evidence shows that better alignment correlates with improved FA performance, particularly on multi-class tasks. Strategies such as norm-matched initialization, sign-locked feedback (sign-FA), or feedback mirroring (adaFA) further enhance convergence and accuracy. Notably, sign-FA’s one-bit feedback also points toward privacy-preserving, communication-efficient schemes (Robertson et al., 2023).

5. Direct Feedback Alignment, Sparsity, and Hardware Realizations

Direct Feedback Alignment (DFA) generalizes FA by projecting the output error directly to each hidden layer with fixed, randomly initialized matrices, removing the necessity for sequential backward error propagation. Sparsifying the feedback matrix (Single-Signal DFA, SSDFA) drastically reduces data movement and energy consumption, particularly attractive for hardware implementations and near-memory architectures (Crafton et al., 2019, Nøkland, 2016).

Key empirical results:

Task/Network	BP Accuracy	DFA/SSDFA Accuracy	Remarks
FC MNIST	98.2%	97.8–97.5%	SSDFA: negligible loss
CIFAR-10 FC	59.9%	58.9–58.6%
Conv MNIST	99.1%	98.8–98.9%
Conv CIFAR-10	79.6%	72.3–73.1%

For deep convolutional models, competitive performance is restored when combining DFA/SSDFA for the fully connected layers with BP-trained (or transferred) convolutional layers. Memory and data-movement savings are on the order of $10^3$ – $10^4\times$ compared to BP (Crafton et al., 2019).

6. Extensions: Federated Learning, Adaptive Feedback, and Neuroscientific Connections

Recent advances incorporate FA into federated learning (FLFA), using the current global model's weights as shared feedback matrices for local training. This method, by aligning local gradients to the global model, effectively mitigates local drift caused by non-IID data while incurring negligible computational and no communication overhead. Empirical FLFA results report accuracy gains of 1–2.5% and consistent drift reduction in diverse benchmarks with $O(1\%)$ extra cost (Baek et al., 14 Dec 2025).

FA-inspired “forward-only” algorithms (e.g., PEPITA, adaptive FA) and dual-objective alignment schemes (Feedback-Feedforward Alignment, FFA) connect FA to biologically hypothesized operations including predictive coding, feedback-based credit assignment, and representational drift. These frameworks unify local learning, inference, and generative feedback within a plausible biological substrate, as well as suggest new avenues for robust, privacy-aware, and distributed machine learning (Toosi et al., 2023, Srinivasan et al., 2023, Cheng et al., 2023).

7. Theoretical and Practical Frontiers

FA’s convergence and alignment phenomena are now well-understood in linear and certain nonlinear network regimes but considerable challenges remain:

Characterizing partial and imperfect alignment’s impact in deep, nonlinear nets and realistic high-dimensional settings (Garg et al., 2021, Cheng et al., 2023).
Developing initialization and scaling heuristics that optimize implicit regularization, avoid anti-regularization, and ensure principal-component-first learning (Girotti et al., 2021, Robertson et al., 2023).
Formulating local, biologically plausible proxies for mutual information which can be maximized in hidden layers, as opposed to using gradients as in BP (Cheng et al., 2023).
Extending FA to architectures beyond MLPs—transformers, large convolutional networks, and multi-task models—where empirical success is mixed and hybrid strategies often yield the best trade-offs (Toosi et al., 2023, Crafton et al., 2019).
Integrating FA with distributed and privacy-preserving learning, where low-bandwidth, quantized, or one-bit feedback naturally aligns with practical deployment constraints (Robertson et al., 2023, Baek et al., 14 Dec 2025).

Overall, the information-embedding theory of FA reframes it as a general-purpose, biologically plausible, and architecture-agnostic learning framework. By leveraging arbitrary but linearly independent feedback pathways to inject target information into hidden codes, FA and its variants enable local, scalable, and robust training dynamics that rival the performance of backpropagation while opening new theoretical and applied research directions (Cheng et al., 2023, Robertson et al., 2023, Garg et al., 2021, Baek et al., 14 Dec 2025).