Optical Dot Products

Updated 27 November 2025

Optical dot products are computed by encoding vector elements into coherent light fields, directly performing scalar multiplications via interference.
Architectures leverage amplitude modulation, interferometric methods, and inverse-designed nanophotonic structures for high-throughput, energy-efficient operations.
Robust calibration techniques like backpropagation help achieve sub-percent error, advancing applications in neural network inference and scientific imaging.

Optical dot products are fundamental computational primitives in photonic hardware for machine learning and scientific computing, enabling the direct mapping of electronic linear algebra onto ultrafast, energy-efficient optical substrates. An optical dot product refers to the process by which two vectors, typically a data input vector $\mathbf{x}$ and a weight vector $\mathbf{w}$ , are encoded into optical fields such that their scalar product $y = \sum_i x_i w_i$ is computed through the physical interaction of those fields and measured at a detector. Developments in optical dot-product engines leverage the coherence, parallelism, and multiplexing capabilities of light, and exploit both traditional photonic circuit elements and inverse-designed nanophotonic structures to scale throughput, energy efficiency, and integration density.

1. Principles of Optical Dot Product Computation

The optical computation of dot products exploits the linearity and superposition of electromagnetic fields. Two principal encoding paradigms are established:

Amplitude Modulation in Coherent Photonic Cores (Xu et al., 2021): Each data element $x_i$ and corresponding weight $w_i$ is separately imposed as an amplitude modulation on a branch of a split, phase-aligned laser beam via Mach–Zehnder modulators (MZMs). The resulting fields, $E_{w,i} = E_0 x_i w_i$ , are coherently recombined. Detection of the summed field, in conjunction with a reference branch, results in a photocurrent $I_\text{photo}$ that, by virtue of the cross-term in $|E_\text{ref}+\sum_i E_{w,i}|^2$ , is linearly proportional to the desired scalar product.
Interference-Based Multiplication (Mathur, 18 Jul 2025): By encoding two numbers $x, y$ onto separate coherent input sources, arranging for constructive (addition) and destructive (subtraction) interference at outputs, and exploiting the identity $(x+y)^2 - (x-y)^2 = 4xy$ , the difference in photodetector signals yields a value proportional to $xy$ . These principles generalize to higher-dimensional dot products via parallel or multiplexed implementations.

Additionally, mode-division and wavelength-division multiplexing harness different optical degrees of freedom to compactly represent and process higher-dimensional vectors (Zhu et al., 18 Jan 2024).

2. Architectures and Implementation Strategies

Optical dot products are realized in several architectures:

Silicon-Based Coherent Dot Product Chips: The optical coherent dot-product chip (OCDC) splits a single laser source into $M$ “data” branches and one “local reference” branch. Each data branch sequentially imposes $x_i$ and $w_i$ using cascaded Mach–Zehnder interferometers. Thermo-optic phase shifters coarsely phase-align branches. The M data and reference fields are recombined in an output cascade of directional couplers and read out by a single photodiode. Time-multiplexed sub-block operations extend the $M$ -core to arbitrary matrix–vector multiplies (“MVMs”) and convolutions through sequential loading (Xu et al., 2021).
Inverse-Designed Nanophotonic Cavities: Optical cavities designed via adjoint-based inverse-design mathematics achieve direct “add” and “sub” interference within a nanoscopic silicon domain. Two analog inputs, encoded in amplitude and sign by the emission direction, excite the cavity; output ports are positioned such that one measures $|x+y|^2$ and the other $|x-y|^2$ , with the difference proportional to $xy$ (Mathur, 18 Jul 2025). This compact approach replaces complex conventional interferometer cascades, yielding substantial reductions in physical area and laser power.
Mode-Multiplexed Cores: Inverse-designed photonic cores encode vector elements in the complex amplitudes of orthogonal spatial modes (e.g., $\psi_1$ , $\psi_2$ ). These are coherently interfered via a single topologically optimized region, combining multiplexers and multimode interferometers. Balanced photodetection with engineered phase shifts produces an output proportional to the real part of the dot product $\mathbf{a} \cdot \mathbf{b}$ (Zhu et al., 18 Jan 2024).

3. Calibration and Error Correction

Fabrication-induced nonuniformities (e.g., imperfect splitting, modulator crosstalk, drift) necessitate in situ calibration:

Backpropagation Calibration (BPC): The OCDC architecture implements a feedback loop wherein a known training set is used to iteratively adjust modulator biases, minimizing the output mismatch between chip and software reference. Gradients are estimated numerically as $\partial L/\partial w_j \propto \sum (I_\text{photo}^{(n)}-\hat{y}^{(n)})x_j^{(n)}$ . Two BPC iterations typically reduce normalized dot product error $\sigma$ from $\sim$ 0.06 to $\sim$ 0.03, enabling sub-percent-level regression fidelity (Xu et al., 2021).
Digital and Phase Calibration: Inverse-designed mode-multiplexed cores may require a digital header or off-chip phase adjustment to compensate for static splitting errors and crosstalk; this preserves dot-product accuracy with minimal overhead (Zhu et al., 18 Jan 2024).
Error decomposition: Single-ended detection architectures are limited by quadratic field dependence ( $I\propto |E|^2$ ) and nonidealities from higher-order crosstalk, limiting accuracy unless balanced (homodyne) detection is implemented.

4. Performance Metrics and Experimental Results

Key experimental metrics and results include:

Architecture	Area (µm² or mm²)	Peak Accuracy	Max Throughput	Energy per MAC	Key Results
OCDC (Xu et al., 2021)	O(M × chip area)	σ=0.0074–0.01	>10¹⁰ MAC/s (proj.)	≤10 fJ/MAC (proj.)	AUTOMAP MRI regression error: OCDC matches 32-bit ref.
Mode-mux core (Zhu et al., 18 Jan 2024)	15	NMSE ≈ 6% (sim)	Up to 10¹³ ops/s/mm²	—	Complex dot, optical flow at cosine sim. 81.8%
Inverse-design cavity (Mathur, 18 Jul 2025)	1.35 mm²	$R^2$ =0.88	—	−0.88% total (DeiT)	Area −88%, laser −23%, O $(xy)$ linearity: $1.057xy+0.249$

OCDC achieves full real-valued encoding for both $x_i,w_i\in\mathbb{R}$ via amplitude modulation, outperforming prior ONNs limited to nonnegative domains (Xu et al., 2021).
Measured root-mean-square image-reconstruction error for benchmark MRI tasks (AUTOMAP): OCDC is within a factor of 2× of 32-bit digital (Xu et al., 2021).
The mode-multiplexed core realizes direct 2-element dot products in a 15 μm² footprint at sub-10% normalized mean squared error, with feasible extension to $10^3$ TOPS/mm² via spatial and wavelength multiplexing (Zhu et al., 18 Jan 2024).
Inverse-designed nanocavities provide direct proportionality $I_\text{out}\propto xy$ over the full input range with $R^2=0.88$ linearity, and enable an 88% area reduction versus legacy photonic transformer cores (Mathur, 18 Jul 2025).

5. Scaling, Limitations, and Architectural Trade-offs

Optical dot-product hardware faces several scaling and architectural constraints:

Temporal and Spatial Multiplexing: To scale beyond the native branch width $M$ of a chip or core, large dot products are synthesized by time-multiplexing input sub-vectors. Additional heads (spatial) can extend aggregate throughput, especially with low-loss waveguide crossings supporting 2D arrays (Xu et al., 2021).
Degree-of-Freedom Multiplexing: Mode-division multiplexing is limited by crosstalk growth and increased fabrication difficulty for $N>2$ spatial modes; promising ultimate scaling combines MDM, wavelength-division, and spatial-location multiplexing (Zhu et al., 18 Jan 2024).
Density vs. Numeric Domain: MZI-based architectures occupy more area than ring-resonator arrays but uniquely support full real-valued domains and systematic calibration, enabling regression tasks previously unattainable in ONN hardware (Xu et al., 2021).
Accuracy Limits: Backpropagation calibration compensates only for (quasi-)linear device variations; nonlinearity, detector noise, and drift phenomena (thermal, environmental) dictate the need for periodic recalibration or feedback.
Photon-budget and Loss: High-speed operation incurs noise and signal-to-noise limits from shot noise, photodiode design, and coupler insertion loss; optimal design may integrate optical amplification or balanced detection.

6. Applications and Prospects

Optical dot products underpin several domains:

Neural Network Inference and Training: Real-valued OCDC architectures enable regression, classification, convolution, and application to models such as AUTOMAP for MRI and potentially for advanced transformers (Xu et al., 2021, Mathur, 18 Jul 2025).
Scientific Imaging and Computer Vision: Direct dot-product cores support matrix operations in optical flow, complex vector algebra, and image reconstruction, demonstrated through Lucas–Kanade flow and complex multiplication (Zhu et al., 18 Jan 2024).
Energetics and Compactness: Inverse-designed cores minimize area and power, enabling chip-scale linear-algebra engines projected to exceed electronic SIMD accelerators in throughput per mm² (Zhu et al., 18 Jan 2024, Mathur, 18 Jul 2025).
Deployment Prospects: Full real-value linearity, sub-percent computational error, and multi-domain encoding position optical dot products for adoption in AI accelerators, scientific instrumentation, and real-time data processing pipelines.

7. Outlook and Research Directions

Recent results position optical dot-product chips and nanophotonic cores as viable components for next-generation deep learning and scientific computing accelerators. Future research is directed toward:

Integrating on-chip high-speed electro-optic modulators and low-power/non-volatile phase shifters to minimize MAC energy.
Expanding the dimensionality of mode-multiplexed and cavity-based designs, involving advances in low-crosstalk inverse design and robust digital calibration.
Developing scalable hybrid electronic–photonic control, especially for memory and feedback.
Investigating ultimate scaling strategies that combine all optical degrees of freedom for multi-teraflop-class photonic linear algebra engines.

The demonstrated transition from toy-scale “classification” ONN chips to regression-competent, high-accuracy, fully optical dot-product hardware marks a significant advancement in photonic computing (Xu et al., 2021, Zhu et al., 18 Jan 2024, Mathur, 18 Jul 2025).