Streaming Submodular Maximization

Updated 22 June 2026

Streaming submodular maximization is a framework for selecting high-quality subsets from sequential data streams under strict memory and time constraints.
Techniques such as threshold greedy, multi-pass, and randomized methods achieve competitive approximations for both monotone and non-monotone objectives across diverse constraints.
Applications in data summarization, sensor placement, and recommendation systems demonstrate its practical impact in processing large-scale, real-time data.

Streaming submodular maximization refers to the family of algorithms and theoretical frameworks for maximizing submodular functions in the streaming model, where a large ground set arrives sequentially and the algorithm must process each item on-the-fly with limited memory, typically in a single pass. This setting encompasses monotone and non-monotone objectives, various constraint structures (such as cardinality, matroid, knapsack, p-matchoid, set systems), and modern extensions including fairness, robustness, and dynamic/decaying data. Streaming submodular maximization is central to large-scale data summarization, sensor placement, recommendation, and clustering, especially when datasets are too large to fit in memory or must be processed in real time.

1. Formal Problem Statement and Key Principles

Given a ground set $V$ arriving as a stream and a submodular function $f:2^{V}\to\mathbb R_{\ge0}$ (so $f$ is monotone if $f(A)\leq f(B)$ for $A\subseteq B$ and in general satisfies $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ for $S\subseteq T$ ), the core objective is

$\max_{S\subseteq V,\,S\in\mathcal{F}} f(S),$

where $\mathcal{F}$ encodes feasibility constraints—typically a cardinality constraint $|S|\leq k$ , but also matroid, knapsack, or $f:2^{V}\to\mathbb R_{\ge0}$ 0-matchoid independence systems.

In the streaming model:

Elements arrive one-by-one; each decision to keep or discard is irrevocable (except for limited “postprocessing”).
The algorithm is constrained to store $f:2^{V}\to\mathbb R_{\ge0}$ 1 or $f:2^{V}\to\mathbb R_{\ge0}$ 2 memory, sublinear in $f:2^{V}\to\mathbb R_{\ge0}$ 3.
Per-element processing is required to be constant or logarithmic time.

Classic examples include facility location and coverage, DPP-based video summarization, network analysis, and combinatorial public project problems (Buschjäger et al., 2020, Norouzi-Fard et al., 2018, Mirzasoleiman et al., 2017).

2. Algorithmic Frameworks and Guarantees

2.1 Thresholded Greedy and Sieve Methods

The foundational approach is the “threshold greedy” or Sieve-Streaming paradigm (Buschjäger et al., 2020)—at each arrival, the algorithm tests if the element’s marginal gain $f:2^{V}\to\mathbb R_{\ge0}$ 4 exceeds a moving threshold (often related to an estimate of $f:2^{V}\to\mathbb R_{\ge0}$ 5), and, if so, admits $f:2^{V}\to\mathbb R_{\ge0}$ 6, provided constraints allow. This admits rigorous resource bounds:

Memory: $f:2^{V}\to\mathbb R_{\ge0}$ 7 elements
Computation: $f:2^{V}\to\mathbb R_{\ge0}$ 8 function oracle calls per item

For cardinality constraints, the Sieve-Streaming algorithm of Badanidiyuru et al. achieves a worst-case $f:2^{V}\to\mathbb R_{\ge0}$ 9-approximation, matching known hardness (Norouzi-Fard et al., 2018, Feldman et al., 2020).

The ThreeSieves algorithm refines this with a single adaptive threshold and a “Rule-of-Three” statistical early-lowering mechanism, yielding, under i.i.d. streams, an $f$ 0-approximation with probability at least $f$ 1, while maintaining strict $f$ 2 memory and constant-per-item time (Buschjäger et al., 2020).

More advanced schemes maintain multiple candidate solutions (QuickStream (Kuhnle, 2020), SALSA (Norouzi-Fard et al., 2018)), random substream sampling (Feldman et al., 2018), or parameterized buffer and thresholding via Stream Clipper (Zhou et al., 2016).

2.2 Multi-Pass and Random-Order Variants

Multi-pass algorithms, such as those in (Huang et al., 2021), raise the approximation to $f$ 3 for cardinality and matroid constraints with $f$ 4 passes, exploiting the continuous greedy paradigm. For random-order streams, the $f$ 5-barrier can be broken even in single-pass (e.g., SALSA achieves $f$ 6 in expectation in random order (Norouzi-Fard et al., 2018)); adversarial order remains bottlenecked at $f$ 7 except with exponential memory (Feldman et al., 2020).

2.3 General Constraints: Matroids, Matchoids, and Beyond

Generalizing to matroids and $f$ 8-matchoids, algorithms need to handle independence-oracle calls and sometimes explicitly maintain exchange structures or chains of candidate sets (Chekuri et al., 2015, Feldman et al., 2021, Huang et al., 2021, Haba et al., 2020):

For $f$ 9-matchoids, O $f(A)\leq f(B)$ 0-space/one-pass algorithms exist with $f(A)\leq f(B)$ 1 approximation for monotone (Feldman et al., 2018, Chekuri et al., 2015).
For k-extendible/k-set systems, streaming reductions from monotone to non-monotone cases deliver O $f(A)\leq f(B)$ 2- and O $f(A)\leq f(B)$ 3-approximations (Haba et al., 2020).
For matroids proper, Chakrabarti–Kale achieves $f(A)\leq f(B)$ 4 in one pass; Huang–Thiery–Ward raise this to $f(A)\leq f(B)$ 5; multi-pass or random-order variants reach $f(A)\leq f(B)$ 6 (Feldman et al., 2021).
Generalization to $f(A)\leq f(B)$ 7-submodular functions is algorithmically supported in both monotone and non-monotone settings, with constant-factor guarantees (e.g., $f(A)\leq f(B)$ 8 for non-monotone cardinality) (Spaeh et al., 2023).

2.4 Knapsack and Multiresource Constraints

For d-knapsack constraints, the “per-weight marginal gain” paradigm is essential (Yu et al., 2016). The leading streaming algorithm delivers a $f(A)\leq f(B)$ 9-approximation, running in one pass with $A\subseteq B$ 0 memory where $A\subseteq B$ 1 is the capacity.

2.5 Dynamic and Robust Streaming

Extensions to data streams with inhomogeneous decay (“sliding window” and variable-lifetime models) (Zhao et al., 2018), and robust maximization (i.e., where some items may be deleted post-stream) employ partitioned thresholding, histogram approaches, or buffering to achieve guarantees with low overhead (Mitrović et al., 2017).

3. Extensions: Fairness, Privacy, and Non-Monotonicity

3.1 Fairness

Fairness in streaming submodular maximization is implemented using lower and upper quotas per group/attribute, realized as partition matroid or laminar-matroid constraints (Halabi et al., 2020, Halabi et al., 2023). A black-box reduction maps the fair problem to standard matroid streaming, giving tight $A\subseteq B$ 2- or $A\subseteq B$ 3-approximations with $A\subseteq B$ 4 space for monotone objectives. For non-monotone functions, sample-based streaming yields randomization-dependent approximation (e.g., $A\subseteq B$ 5).

3.2 Differential Privacy

Differentially private streaming maximization is feasible via Laplace or Gumbel perturbation in the thresholded greedy steps (Chaturvedi et al., 2022). For general instances, a $A\subseteq B$ 6-approximation minus $A\subseteq B$ 7 additive error is possible; for decomposable functions, Gumbel-based approaches reduce the error to $A\subseteq B$ 8.

3.3 Non-Monotone Objectives

For streaming non-monotone submodular maximization, approaches exploit multiple independent chains, random sampling, or reductions to monotone routines. State-of-the-art single-pass streaming for cardinality and $A\subseteq B$ 9-matchoid constraints achieves $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 0-approximation with polynomial space for cardinality (Alaluf et al., 2019), and $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 1-approximation for $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 2-matchoids (Feldman et al., 2018, Mirzasoleiman et al., 2017). Streaming Local Search provides principled reductions for complex constraints including multiple knapsacks or matroids (Mirzasoleiman et al., 2017).

4. Empirical Performance and Use Cases

Streaming submodular maximization algorithms exhibit strong empirical performance on summarization, recommendation, clustering, and influence maximization (Buschjäger et al., 2020, Zhou et al., 2016, Norouzi-Fard et al., 2018, Yu et al., 2016). Key trends from benchmarks:

Modern methods (ThreeSieves, QuickStream, Stream Clipper) match or nearly match offline greedy baselines, often within 1–5% on effective objective value, while consuming orders of magnitude less computation and memory.
Adaptive schemes (e.g., ThreeSieves) handle mild concept drift, maintaining high performance.
On recommendation and clustering tasks, fairness constraints can be enforced at minimal (<15%) additional cost (Halabi et al., 2020, Halabi et al., 2023).
For dynamic window/decaying streams, HistApprox and HistStreaming maintain 80–95% of offline solution quality with significant speed and memory savings (Zhao et al., 2018).

5. Hardness, Trade-Offs, and Theoretical Limits

Fundamental limits govern what can be achieved in streaming submodular maximization:

For monotone objectives with cardinality constraints and adversarial-order streams, any single-pass, $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 3-memory algorithm cannot beat $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 4-approximation (Norouzi-Fard et al., 2018, Feldman et al., 2020).
Higher ratios (up to $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 5) require either multiple passes, exponential memory, or random-order streams.
For matroid and $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 6-matchoid constraints, the best single-pass ratios scale as $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 7; multi-pass approaches can close the gap to $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 8 but require $f(S\cup\{e\})-f(S)\ge f(T\cup\{e\})-f(T)$ 9 passes (Huang et al., 2021, Feldman et al., 2021).
For non-monotone objectives, ratios better than $S\subseteq T$ 0 require exponential time offline post-processing or polynomial space/memory.
Fairness and privacy induce additional trade-offs: achieving both strong utility and strict fairness constraints typically costs a factor of $S\subseteq T$ 1 or more in approximation or requires multi-pass computation (Halabi et al., 2020, Halabi et al., 2023).

6. Practical Parameter Tuning and Implementation Guidelines

Algorithm selection and parameterization are dictated by resource and application constraints:

When one-pass operation and single-item latency are mandatory, and input streams are near-i.i.d., algorithms like ThreeSieves ( $S\subseteq T$ 2, $S\subseteq T$ 3-- $S\subseteq T$ 4) optimize for high probability near-optimal solutions in $S\subseteq T$ 5 space (Buschjäger et al., 2020).
For matroid, $S\subseteq T$ 6-matchoid, or knapsack, space and query complexity scale with $S\subseteq T$ 7 and $S\subseteq T$ 8; memory must be budgeted accordingly (Chekuri et al., 2015, Yu et al., 2016, Feldman et al., 2018).
Concept drift should be handled by active monitoring and potentially restart mechanisms.
For high-throughput streaming, threshold parameters should be empirically calibrated to balance accept/reject rates against available memory.
Privacy-sensitive applications should prefer Gumbel-perturbation when feasible for improved utility/accuracy scaling (Chaturvedi et al., 2022).

The field continues to evolve in dimensions such as adaptive streaming, robust and private maximization, complex combinatorial constraints, and real-time, high-speed data summarization. Mathematical hardness results delineate the boundaries for approximation, resource usage, and model-driven relaxations. Leading algorithms are accompanied by open-source implementations and validated across large-scale, real-world data sets, consolidating streaming submodular maximization as a bedrock technique in modern data-intensive AI.