BrainPy: Differentiable Brain Simulator

Updated 21 April 2026

BrainPy is a differentiable brain simulator that integrates realistic neural dynamics, scalable JIT compilation, and gradient-based optimization within a modular Python ecosystem.
Leveraging JAX’s automatic differentiation and vectorization, BrainPy employs efficient JIT compilation and sparse connectivity techniques to accelerate neural simulations.
Its advanced framework supports multi-scale modeling from single-neuron biophysics to large-scale connectomes, enabling novel approaches in brain-inspired machine learning and neuroengineering.

BrainPy is a differentiable brain simulator that unifies biologically realistic neural modeling, brain-inspired computing, and scalable, gradient-based optimization. Built as a pure Python library on top of JAX and utilizing accelerated linear algebra (XLA), BrainPy enables efficient just-in-time (JIT) compilation, automatic vectorization, and reverse-mode automatic differentiation. It is designed for multi-scale modeling, providing a modular, extensible ecosystem that supports single-neuron biophysics, large-scale connectome-driven networks, and end-to-end training on cognitive tasks (Wang et al., 2024, Wang et al., 2023).

1. Software Architecture and Core Modules

BrainPy’s architecture is tightly integrated with JAX, leveraging JAX’s JIT compilation, autodiff, and device parallelism (vmap, pmap) to achieve high computational efficiency.

Object-Oriented JIT Compilation: Neural models, synaptic operations, and networks are implemented as Python classes, typically subclasses of brainpy.DynamicalSystem. Class tracing (cls_jit) fuses method calls and event-driven computations into XLA graphs, maximizing kernel fusion and eliminating Python call overhead (Wang et al., 2023).
Sparse and Memory-Efficient Connectomics: Connectivity is handled via compressed sparse row (CSR) and just-in-time (JIT) operators. CSR format stores only nonzero elements for large networks, while JIT connectivity operators (e.g., regenerating Erdős–Rényi connections on the fly) maintain nearly constant memory usage as network size increases (Wang et al., 2023).
Core Modules:
- bp.dyn: numerical solvers and neuron/synapse definitions
- bp.math: tensor operations, surrogate gradients, event-driven/sparse operators
- bp.losses: standard objectives for voltage and spike-train fitting
- bp.opt: high-performance optimizers (L-BFGS-B, Adam, SGD, etc.)
- bp.network: multi-scale interfaces for constructing networks with connectomic data
- BrainScale: optional module for memory-efficient spiking RNN training via online eligibility traces (Wang et al., 2024).

2. Differentiable Modeling Across Scales

BrainPy’s central innovation is differentiability from subcellular to behavioral scales.

Single-Neuron Models: Implemented as explicit ODEs with continuous-time or discretized solvers (Euler, Runge–Kutta, SDE, adaptive). Supported models include generalized leaky integrate-and-fire (GIF), Hodgkin–Huxley (HH), adaptive LIF, Izhikevich, and more (Wang et al., 2024, Wang et al., 2023).
Spike Surrogate Gradients: Non-differentiable thresholding is replaced by smooth surrogates (e.g., triangular functions) to enable gradient flow through spiking discontinuities (Wang et al., 2023).
Network Level: Event-driven sparse operators (CSR-based) propagate gradients via JAX custom_vjp rules. Only nonzero spike events trigger computation, leading to substantial speedups (Wang et al., 2024).
End-to-End Task Training: Behavioral-level models—spiking networks with differentiable read-outs—are trainable using surrogate gradients and online eligibility traces (BrainScale), enabling gradient signals to pass from task loss all the way to neuron and synapse parameters (Wang et al., 2024, Wang et al., 2023).

3. Mathematical Formulation and Model Types

BrainPy supports a range of neuron and synapse dynamics within its differentiable pipeline.

GIF Neuron:

$\tau_{I1} \frac{dI_1}{dt} = -I_1 \ \tau_{I2} \frac{dI_2}{dt} = -I_2 \ \tau_V \frac{dV}{dt} = -V + V_{rest} + R(I_1 + I_2 + I_{ext})$

Spike at $V(t) \geq V_{th} \rightarrow$ resets $I_1, I_2, V$ (Wang et al., 2024).

Hodgkin–Huxley:

$C \frac{dV}{dt} = -g_L (V-E_L) - g_{Na} m^3 h (V-E_{Na}) - g_K n^4 (V-E_K) + I_{ext}$

Gating variable ODEs as in HH (Wang et al., 2024).

Synaptic Dynamics:

$\tau_{syn} \frac{dg_{exc}}{dt} = -g_{exc}, \quad \text{on spike: } g_{exc} \leftarrow g_{exc} + W_{ij}$

Similar for $g_{inh}$ ; input current sums over conductances (Wang et al., 2024, Wang et al., 2023).

Event-Driven Operators:

Efficient implementation of

$y = W \cdot x$

is realized as: For $i$ where $x[i]=1$ , increment all $y[col\_ind[idx]]$ by $V(t) \geq V_{th} \rightarrow$ 0 for indices $V(t) \geq V_{th} \rightarrow$ 1 in $V(t) \geq V_{th} \rightarrow$ 2 to $V(t) \geq V_{th} \rightarrow$ 3 (Wang et al., 2024).

4. Gradient-Based Fitting, Optimization, and Learning

BrainPy extends JAX’s autodiff to the neuroscientific domain, supporting both classical and modern optimization methods.

Objective Functions:
- Membrane voltage: Mean Squared Error
$V(t) \geq V_{th} \rightarrow$ 4 - Spike train similarity: Gamma factor and spike-matching loss (Wang et al., 2024).
Optimization Algorithms:

Gradient-based (L-BFGS-B, Adam, SGD, RMSProp) and black-box (DE, PSO, Bayesian) methods, with L-BFGS-B offering superior convergence for mechanistic models. Gradient updates can be efficiently batched (vmap) for parallelized parameter fitting (Wang et al., 2024).

Online Learning (BrainScale):

Surrogate backpropagation through time is replaced by local eligibility traces,

$V(t) \geq V_{th} \rightarrow$ 5

and the weight update,

$V(t) \geq V_{th} \rightarrow$ 6

(Wang et al., 2024).

5. Large-Scale Network Construction and Connectomic Integration

BrainPy enables construction of multi-scale brain models constrained by empirical data.

Connectomic Data Handling:

Inputs/recurrence specified as CSR matrices containing empirical connection probabilities or strengths. Anatomical constraints are mapped directly onto network weights, eliminating the need for post hoc parameter matching (Wang et al., 2024).

Memory Scaling:

Memory usage is held approximately constant in JIT-connectivity mode, compared to quadratic scaling in dense formats—allowing simulation of networks with millions of neurons and connections (Wang et al., 2023).

6. Performance Benchmarks and Comparative Results

Empirical benchmarks highlight BrainPy’s computational efficiency and accuracy.

Task	BrainPy Performance	Comparator Performance
Event-driven mat-vec (CPU/GPU)	100×–100,000× faster vs dense/sparse	Conventional routines (CPU/GPU)
COBA-LIF/COBA-HH simulation (10³–10⁵ neurons)	2×–10× faster vs NEURON, NEST, Brian2, ANNarchy, BindsNet	NEURON, NEST, Brian2, etc.
GIF neuron fit (L-BFGS-B, s/fit)	Loss 6.80 ± 4.62; 5.40 s (L-BFGS-B)	Higher loss or runtime for DE, PSO, Bayesian
HH neuron fit (L-BFGS-B, s/fit)	Loss 2.3e-8 ± 1.6e-8; 3.82 s (L-BFGS-B)	Higher loss/runtime (DE, PSO, Bayesian)
Large SNN memory (T=600, batch=128)	Constant ≈0.5 GB (BrainScale online); BPTT OOM beyond T≈600	BPTT linear in T, OOM at moderate T
VGG-SNN training (epoch, GPU)	50 s (BrainPy)	104 s (SpikingJelly)

Gradient-based single-neuron fitting consistently outperforms black-box methods on loss, with comparable or superior runtime. BrainPy’s BrainScale algorithm enables stable memory usage and speedups versus backpropagation through time (BPTT) for long sequences (Wang et al., 2024, Wang et al., 2023).

7. Use Cases, Extensibility, and Integration

BrainPy is directly applicable in several subfields:

Biologically Plausible SNN Training: Data-driven SNNs reproducing in vivo prefrontal-cortex activity, trained on tasks such as delayed match-to-sample, with observed emergence of biologically accurate firing and synaptic dynamics (Wang et al., 2023).
Brain-Inspired Machine Learning: Reservoir computing architectures achieving state-of-the-art results on KTH and MNIST benchmarks (over 94% and 98% accuracy, respectively), with accelerated inference and scaling via JIT connectivity (Wang et al., 2023).
Distributed and Multi-Device Modeling: Experimental support for large-scale, multi-area models distributed across multiple GPUs/TPUs using JAX’s pmap/pjit (Wang et al., 2023).
Ecosystem Integrations: Direct composability with JAX/Flax/Haiku models; e.g., supplementing or embedding BrainPy networks as modules within larger deep learning architectures (Wang et al., 2023).

A plausible implication is the increasing convergence of mechanistically detailed brain simulation and scalable AI platforms, as enabled by object-oriented, differentiable, and memory-efficient frameworks such as BrainPy.

References:

(Wang et al., 2024) A Differentiable Approach to Multi-scale Brain Modeling (Wang et al., 2023) A differentiable brain simulator bridging brain simulation and brain-inspired computing

Markdown Report Issue Upgrade to Chat

References (2)

A Differentiable Approach to Multi-scale Brain Modeling (2024)

A differentiable brain simulator bridging brain simulation and brain-inspired computing (2023)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to BrainPy.