Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 175 tok/s
Gemini 2.5 Pro 52 tok/s Pro
GPT-5 Medium 36 tok/s Pro
GPT-5 High 38 tok/s Pro
GPT-4o 92 tok/s Pro
Kimi K2 218 tok/s Pro
GPT OSS 120B 442 tok/s Pro
Claude Sonnet 4.5 38 tok/s Pro
2000 character limit reached

M/M/c Queue Model: Analysis & Extensions

Updated 9 November 2025
  • M/M/c Queue Model is a continuous-time, Markovian multi-server queue characterized by Poisson arrivals, exponential service times, and Erlang’s C formula.
  • The model enables closed-form computation of key metrics such as server utilization, waiting probability, and mean sojourn time, supporting robust system design.
  • Extensions include priority schemes, state-dependent service, and extreme-value analysis, broadening its application in telecommunications and operational research.

The M/M/cM/M/c queue is a classical continuous-time, Markovian multi-server queueing model central to queueing theory, performance analysis, and operations research. Defined by Poisson arrivals, cc parallel servers, and independent exponential service times, it underlies a wide range of performance models in e.g. telecommunications, call centers, and service systems. Recent research continues to extend, compute, and apply its probabilistic properties: from joint queue-length distributions and extreme-value statistics to busy-period durations, multi-priority systems, and state-dependent service mechanisms.

1. Formal Model Description

Let arrivals form a Poisson process of rate λ\lambda. There are cc identical servers, each independently serving one customer at exponential rate μ\mu. The system allows infinite waiting room; arrivals that find fewer than cc customers in service immediately begin service, while those finding all cc servers occupied join a first-come-first-served backlog queue. Stability requires ρ:=λ/(cμ)<1\rho := \lambda/(c\mu) < 1. The system state is X(t)=nX(t) = n (0n<0 \le n < \infty), the total number of customers present. The process is birth–death, with rates: λn=λn,\lambda_n = \lambda \quad \forall n,

μn={nμ,nc cμ,nc\mu_n = \begin{cases} n\mu, & n \le c \ c\mu, & n \ge c \end{cases}

Balance equations yield the steady-state probabilities Pn=limtP{X(t)=n}P_n = \lim_{t\to\infty}\mathbb{P}\{X(t)=n\}. The normalization is: 1P0=j=0c1(cρ)jj!+(cρ)cc!(1ρ)\frac1{P_0} = \sum_{j=0}^{c-1} \frac{(c\rho)^j}{j!} + \frac{(c\rho)^c}{c!(1-\rho)} The explicit forms are: Pn={P0(cρ)nn!,n<c P0(cρ)cc!ρnc,ncP_n = \begin{cases} P_0 \frac{(c\rho)^n}{n!}, & n < c \ P_0 \frac{(c\rho)^c}{c!}\rho^{n-c}, & n \geq c \end{cases} The probability that an arrival must wait (all servers busy) is: P{wait}=(cρ)cc!(1ρ)P0P\{\text{wait}\} = \frac{(c\rho)^c}{c!(1-\rho)}P_0 This is Erlang’s CC formula and is foundational across contexts requiring multi-server delay analysis.

2. Key Performance Metrics and First-Order Properties

The principal performance metrics and their closed forms are as follows:

Measure Expression Comments
Utilization per server ρ=λcμ\rho = \frac{\lambda}{c\mu} <1< 1 for stability
Mean number in queue (LqL_q) Lq=P{wait}ρ1ρ=(cρ)cc!(1ρ)2P0L_q = P\{\text{wait}\}\frac{\rho}{1-\rho} = \frac{(c\rho)^c}{c!(1-\rho)^2}P_0 Excludes those in service
Mean queueing time (WqW_q) Wq=Lq/λW_q = L_q/\lambda Little’s law
Mean system size (LL) L=Lq+λ/μL = L_q + \lambda/\mu LqL_q plus those in service
Mean sojourn time (WW) W=Wq+1/μW = W_q + 1/\mu Includes service time
Probability all servers idle (P0P_0) [see above][\,\text{see above}\,] Explicit sum

These explicit steady-state forms enable closed-form, numerically stable computation for arbitrary (λ,μ,c)(\lambda, \mu, c), rendering the M/M/cM/M/c queue tractable for analytic, design, and simulation contexts (Finch, 2019).

3. Distributional and Extreme-Value Properties

Beyond stationary means, recent work has addressed the high-level behavior of the queue, particularly the distribution of the maximum queue length over a finite interval. Let Mn=maxt[0,n]{queue length at t}M_n = \max_{t\in [0, n]}\{\text{queue length at } t\} (excluding those in service). The Poisson clumping heuristic provides an accurate asymptotic approximation for this extreme-value distribution.

Let q=λ/(cμ)q = \lambda/(c\mu) and define k=log1/q(n)+h+1k = \log_{1/q}(n) + h + 1. The probability that the maximum does not exceed kk is, for large nn,

Pr{Mnlog1/q(n)+h}exp[Cqh+1]\Pr\{M_n \leq \log_{1/q}(n) + h\} \approx \exp\left[-C\, q^{h+1}\right]

where

C=cc2λμc3(cμλ)2j=1cj!(c1j1)λcjμj1C = c^{c-2}\lambda\mu^{c-3}(c\mu - \lambda)^2 \sum_{j=1}^c j! \binom{c-1}{j-1} \lambda^{c-j}\mu^{j-1}

Higher-order (finite-nn) corrections introduce an additional bracketed term inside the exponential, improving the fit for moderate nn (10310^310410^4). The limit law is a discrete Gumbel and governs extremes for general cc (Finch, 2019). Mean and variance estimates are given by: E[Mn]lnnln(1/q)+γ+ln(Cq)ln(1/q)+12,\mathbb{E}[M_n] \approx \frac{\ln n}{\ln(1/q)} + \frac{\gamma + \ln(C q)}{\ln(1/q)} + \frac{1}{2},

Var(Mn)π2/6[ln(1/q)]2+112\mathrm{Var}(M_n) \approx \frac{\pi^2/6}{[\ln(1/q)]^2} + \frac{1}{12}

These expressions match simulation results to a high degree of accuracy.

4. Extensions: Priority Classes and State-Dependent Service

Priority Extensions

Multi-class and multi-priority M/M/cM/M/c models have been extended to both non-preemptive and preemptive regimes.

Non-preemptive priority with KK classes: Poisson arrivals at rates λκ\lambda_\kappa, exponential services, cc servers. Zuk and Kirszenblat (Zuk et al., 2023) derive explicit forms for the joint queue-length generating function G0(z1,,zK1)G_0(z_1,\ldots,z_{K-1}) as a product of root-ratio terms, and invert it via multidimensional FFT to recover the joint stationary distribution. The fixed-point iteration for the vectorized stationary balance equations is robust and probability-preserving, with convergence slowed only as r1r\to 1 or KK increases. The methodology yields all joint and marginal performance measures and has been numerically validated to high precision.

Preemptive two-class priority: The continuous-time Markov structure yields Laplace transforms for all finite-time transition probabilities via spectral methods, including explicit closed forms for the vertical boundary (states with jcj\ge c high-priority customers). The general case is recursively tractable with Ramaswami’s formula adapted from M/G/1M/G/1 Markov chains (Selen et al., 2016).

State-Dependent and Time-Dependent Service

Service rates affected by experienced waiting time are considered in (D'Auria et al., 2021): each arrival is "tagged" by whether its virtual waiting time at entry exceeds a threshold τ\tau, and is served at either rate μ1\mu_1 (if WτW\leq\tau) or μ2\mu_2 (if W>τW>\tau). The model is formulated via a Markov process tracking the virtual queueing time and server occupancy, matched to an embedded process with piecewise ODE structure. The stationary distribution is characterized recursively and in closed form as a mixture of exponentials for the waiting time, with rates determined as zeros of characteristic polynomials tied to the underlying Markov dynamics. When μ1μ2\mu_1\ne \mu_2, performance can change drastically, with slowdown (μ2<μ1\mu_2<\mu_1) causing heavy-tailed delays.

5. Partial Busy Periods and Time-Dependent Behavior

The partial busy period (TbpT_{\mathrm{bp}}): the time from system entry by an arrival (when queue was empty) until the system empties again, is analyzed explicitly for general cc in (Zuk et al., 2023). For c=2c=2 the result connects to classic Karlin–McGregor integral representations. For arbitrary cc, two main techniques are provided:

  • Spectral method: The Laplace transform ϕ1(s)\phi_1(s) is written in terms of resolvent sums over eigenvalues of the finite-dimensional generator, leading to a survival function as a sum of exponentials (from simple poles) plus a branch cut integral.
  • Algebraic method: The closed-form representation involves a sum over Marcum Q-functions, with parameters and coefficients arising from roots of a uniquely defined "cut polynomial" of degree $2c-3$.

Large-cc asymptotics distinguish two regimes: fixed-traffic intensity (r<1r<1) gives a mixture of empty and exponentially-distributed periods; the M/M/M/M/\infty scaling yields mixtures tied to zeros of Kummer functions.

Efficient computation leverages eigenvalue routines (e.g., LAPACK’s {\tt dstevd}), Chebyshev quadrature, and FFT-based polynomial root solvers. These approaches allow full distributional computation for cc up to O(102)O(10^2).

6. Analytical and Computational Methods

Explicit forms for steady-state and transient distributions are available in the classical case; closed-form, efficiently computable representations for time-dependent, multi-class, and state-dependent extensions require:

  • Reduction to birth–death or block-tridiagonal Markov chains;
  • Laplace transform (spectral) analysis, inversion via residue and cut contours;
  • Explicit, finite-sum representations (Marcum-Q, Bessel series, or mixture of exponentials) using parameter polynomials derived from model structure;
  • Fixed-point or matrix-analytic recursions (Ramaswami-like algorithms) for balance equations in high-dimensional spaces;
  • Accelerated numerical inversion (FFT-based or Gauss–Chebyshev quadrature) for generating functions and transforms, maintaining probability normalization and exploiting structure for tractability;
  • Direct simulation for empirical validation, as in the extreme-value paper of (Finch, 2019).

Accuracy, robustness, and computational efficiency are achieved up to c102c \sim 10^2 and, for priority models, K3K\sim 3–$5$.

7. Applications and Implications

The M/M/cM/M/c model and its extensions directly address:

  • Performance guarantees in service systems (delay probabilities, maximal queue length, mean waiting times);
  • Regenerative simulation interval estimation and confidence procedures (busy period distributions) (Zuk et al., 2023);
  • Service system design; e.g., setting cc to meet SLA thresholds (via Erlang CC or joint queue-length measures);
  • Priority queueing in telecommunications, health care, or call centers (preemptive vs non-preemptive delay, fairness, overflow probabilities);
  • Queueing-time dependent dynamical control policies and their impact.

The collective body of work reaffirms the tractability and flexibility of the M/M/cM/M/c queue for both theoretical and applied queueing analysis, while delineating the frontiers of what is computable: joint queue-lengths for many classes require FFT inversion; busy periods demand spectral tools; maxima require clumping heuristics. Generalizations for non-Markovian service and arrival, reneging, or blocking remain challenging.

8. Context and Future Research

Recent advances yield the most explicit known formulas for joint queue-lengths (Zuk et al., 2023), partial busy periods (Zuk et al., 2023), and time-dependent multi-class probabilities (Selen et al., 2016) for M/M/cM/M/c systems. These results fill longstanding gaps, especially for c>2c>2 and general KK.

A plausible implication is that similar algebraic–spectral hybrid techniques could be adapted to M/G/cM/G/c or GI/M/cGI/M/c queues, though the loss of Markov structure presents substantial obstacles. As system parameters scale (large cc, near-critical ρ1\rho\to 1), asymptotic analysis and numerically stable algorithms remain an ongoing research frontier.

The M/M/cM/M/c queue retains a central role in queueing theory both as a reference model and as a basis for extendable, analytically tractable specializations in modern stochastic network analysis.

Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to M/M/c Queue Model.