Papers
Topics
Authors
Recent
2000 character limit reached

Reproduction Numbers R_0, R_t for COVID-19 Infections with Gaussian Distribution of Generation Times, and of Serial Intervals including Presymptomatic Transmission (2511.04574v1)

Published 6 Nov 2025 in q-bio.PE

Abstract: Basic and instantaneous reproduction numbers, "R" _"0" and "R" _"t" , are important metrics to assess progress of an epidemic and effectiveness of preventative interventions undertaken, and also to estimate coverage needed for vaccination. Reproduction numbers are related to the daily number of positive cases recorded by the national public health authorities, via the renewal equation. During periods of exponential growth or decay they are linked also to the rate constants by the Lotka-Euler equation. For either application, we need the distribution of generation times between primary and secondary infections. In practice, we use instead the directly observable serial interval between symptoms onset of infector and infectee. Pre-symptomatic transmission that occurs in COVID infection causes serial intervals to extend to negative values, which can be described with a Gaussian distribution. Consistent application of the two approaches requires careful attention to lower limits imposed on the distribution. Allowing Gaussian-distributed serial intervals to extend to minus infinity with the Lotka-Euler equation, as commonly is done, results in lower reproduction numbers than predicted from the discretized renewal equation. Here, we formulate the Lotka-Euler equation for Gaussian distributions including an explicit lower cut-off, and use this to explore the consequences of presymptomatic transmission for COVID-19 infections.

Summary

  • The paper demonstrates that properly truncating Gaussian distributions for serial intervals is crucial in accurately estimating R0 and Rt values in COVID-19.
  • It employs renewal equations and the Lotka-Euler formalism to rigorously compare generation time and serial interval-based reproduction numbers.
  • Empirical evaluations reveal systematic underestimations in common methods, underscoring significant implications for epidemiological surveillance and intervention strategies.

Reproduction Numbers for COVID-19: Implications of Gaussian Serial Intervals with Presymptomatic Transmission

Overview

This paper provides a rigorous mathematical and epidemiological framework for calculating the basic and instantaneous reproduction numbers (R0R_0, RtR_t) for COVID-19, incorporating the observed phenomenon of presymptomatic transmission. The author focuses on the consequences of utilizing Gaussian-distributed serial intervals (SIs)—which necessarily extend to negative values due to infectiousness occurring prior to symptom onset—and discusses the implications for commonly used methods relating incidence data to reproduction numbers. Explicit attention is given to the need for lower truncation of Gaussian distributions for both serial intervals and generation times, a factor that is often neglected in previous literature and which leads to systematic underestimation of reproduction metrics when omitted.

Mathematical Framework for Generation Times and Serial Intervals

Renewal Equations and Lotka-Euler Formalism

The established approach links incidence of infection, serial intervals, and reproduction numbers via renewal equations:

C(t)=R0S(t)∫Tm∞C(t−τ)g(τ)dτC(t) = R_0 S(t) \int_{T_m}^{\infty} C(t-\tau) g(\tau) d\tau

where g(Ï„)g(\tau) is the probability density function (PDF) for generation time or its proxy, the serial interval.

In periods of exponential growth/decline, the Lotka-Euler equation connects the reproduction number with the exponential rate constant rr:

1R0=∫Tm∞e−rτg(τ)dτ\frac{1}{R_0} = \int_{T_m}^{\infty} e^{-r\tau} g(\tau) d\tau

Necessity of Distribution Truncation

The paper emphasizes that, due to presymptomatic transmission in COVID-19, the serial interval can take negative values—unlike the actual generation time, which must be strictly positive. Using a Gaussian distribution for SIs yields a nonzero probability density for arbitrarily large negative intervals, which is unphysical given the constraints imposed by human incubation periods.

Explicit truncation at a lower limit (roughly the mean incubation duration) is essential:

  • For generation time, Tm≃+1T_m \simeq +1 day.
  • For serial interval, Tm≃−5T_m \simeq -5 to −6-6 days (COVID-19 mean incubation period).

Failure to impose these limits, as is common when naively applying continuous Gaussians, results in underestimation of R0R_0 and RtR_t; this is quantitatively substantiated in this work.

Empirical Evaluation and Results

Comparative Analysis: Generation Times vs Serial Intervals

Using parameterizations from Ganyani et al. (2020), the author evaluates reproduction numbers for COVID-19 in Germany across epidemic waves:

  • GT means: 3.86 days (Singapore), 2.90 days (Tianjin)
  • SI means: similar, but with greater variance

Key finding: Reproduction numbers calculated using generation times are systematically higher than those calculated using serial intervals, especially when using properly truncated Gaussian distributions. This divergence increases with distance from the Rt=1R_t=1 threshold.

For example:

  • With the Singapore data, R0R_0 for Gaussian GT-distribution: 2.84; for gamma GT-distribution: 2.27; for Gaussian SI-distribution: 1.67; and for untruncated Gaussian (Tm→−∞T_m \to -\infty): 1.29.
  • For Tianjin, R0R_0 are: 2.57 (Gaussian GT), 1.76 (gamma GT), 1.29 (Gaussian SI), 0.95 (untruncated Gaussian).

The claim that standard application of Eq. 11 (untruncated Gaussian) "mostly predicts values of R0R_0 that are too low" is strongly supported by these numerical comparisons, highlighting an underappreciated bias in the literature.

Serial Interval Distributions and Implications of Truncation

Analysis of the SI datasets from Ali et al. (2020) and Du et al. (2020) illustrates:

  • Inclusion of negative intervals (presymptomatic transmission) leads to broader SI distributions and alters the inferred temporal profile of RtR_t.
  • As interventions progress during an epidemic, the mean SI shortens (reflecting the impact of NPIs), with corresponding reductions in RtR_t.
  • Excluding negative SIs (e.g., using only t>0t > 0) erases this effect and leads to distribution mis-specification; truncated data best fit lognormal rather than normal distributions in this regime.

Distributional Choices: Gaussian vs Gamma and Lognormal

For measured generation times (always positive), gamma distributions are more appropriate. The comparison in the appendix shows:

  • Gamma-distributed GTs yield lower R0R_0 and RtR_t values compared to Gaussians with the same moments, while both remain above those computed using SI proxies or untruncated Gaussians.
  • The shapes of temporal RtR_t profiles are robust to the choice of distribution, but the excursion magnitudes vary.

Practical Implications

Epidemiological Surveillance and Model Calibration

  • Truncation of SIs is essential for correct estimation of reproduction numbers in ongoing surveillance and model calibration tasks.
  • Estimation of vaccination coverage and intervention effectiveness critically depends on accurate R0R_0, necessitating proper treatment of serial interval distributions in renewal-based and Lotka-Euler-based approaches.
  • Retrospective estimation is required for RtR_t when negative serial intervals are present because future incidence informs past infection events.

Guidance for Statistical Modelling

  • When fitting SI data for COVID-19 or similar pathogens with presymptomatic transmission, Gaussian models must be truncated at the lower bound defined by biological constraints.
  • For generation time estimation and prediction, gamma or lognormal distributions should be preferred unless the empirical distribution is decisively symmetric and sharply peaked.

Speculation on Future Developments

  • As more pathogens with presymptomatic transmission are identified, standardized procedures for truncation and distribution selection must be codified in surveillance practice.
  • Expansion to joint inference frameworks incorporating incubation periods, infectiousness profiles, and SI distributions will enhance reproduction number estimation.
  • Further methodological research may develop more flexible distributions or copula approaches capturing the full range of observed intervals.

Conclusion

This paper systematically demonstrates that accurate estimation of reproduction numbers in epidemics with presymptomatic transmission—such as COVID-19—requires Gaussian distributions of serial intervals to be truncated at lower limits consistent with biological reality. Neglecting this truncation, as is common, leads to underestimation of R0R_0 and RtR_t, with downstream consequences for intervention strategy and policy. The findings underscore the necessity for robust epidemiological methods and careful statistical practice in real-time infectious disease modelling. The theoretical developments and comparative analyses herein suggest avenues for improved surveillance, reporting, and calibration of dynamic models in future epidemic settings.

Whiteboard

Open Problems

We're still in the process of identifying open problems mentioned in this paper. Please check back in a few minutes.

Authors (1)

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.