Papers
Topics
Authors
Recent
Search
2000 character limit reached

Submodular Set Coverage Maximization

Updated 23 May 2026
  • Submodular set coverage maximization is the problem of selecting subsets to maximize a monotone submodular objective under cardinality constraints, commonly applied in coverage tasks.
  • It employs methods like the Nemhauser–Wolsey–Fisher greedy and continuous greedy algorithms, achieving approximation ratios such as 1-1/e and 1-(1-c)^(1/c) based on the subset size.
  • Advanced techniques including LP/SDP rounding and symmetry-gap constructions reveal structural separations, enhancing performance in diverse settings like distributed and noisy models.

Submodular set coverage maximization concerns the selection of subsets under cardinality or other constraints to maximize a monotone submodular objective—often, but not exclusively, a classical coverage function. This problem lies at the heart of combinatorial optimization and intersects the general theory of submodular maximization, maximum coverage, and their algorithmic approximability. Recent advances have clarified the fundamental approximation barriers and have revealed separations between general submodular and pure coverage objectives when the subset size is a constant fraction of the ground set.

1. Problem Formulation and Classical Results

The archetypal problem instances are:

  • Maximum Coverage (MC): Given a ground set EE, a family of nn sets S1,…,Sn⊆ES_1,\dots,S_n\subseteq E and element weights we≥0w_e \ge 0, select kk sets to maximize the total weight of covered elements:

fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k

  • Monotone Submodular Maximization (SM): For a monotone submodular set function f:2U→R+f: 2^U \rightarrow \mathbb{R}_+ (i.e., f(A)≤f(B)f(A) \leq f(B) for A⊆BA \subseteq B and f(A∪{e})−f(A)≥f(B∪{e})−f(B)f(A \cup \{e\}) - f(A) \geq f(B \cup \{e\}) - f(B) for all nn0), select nn1 elements to maximize nn2.

The set coverage function is a canonical example of a monotone submodular function, but general submodular functions can encode more complex objectives.

The classical Nemhauser–Wolsey–Fisher greedy algorithm iteratively adds the element with largest marginal gain, achieving approximation ratio nn3 for both MC and general monotone SM, which is provably optimal—NP-hard to improve for MC [Feige ’98], and value-oracle optimal for general monotone submodular [Nemhauser–Wolsey ’78; (Filmus et al., 2024)].

2. c-Dependent Approximation and Separation Phenomenon

For cardinality constraint nn4 with constant nn5, the approximation barrier for monotone submodular maximization is strictly better than nn6 as nn7 increases:

nn8

This ratio smoothly interpolates from nn9 as S1,…,Sn⊆ES_1,\dots,S_n\subseteq E0 up to S1,…,Sn⊆ES_1,\dots,S_n\subseteq E1 at S1,…,Sn⊆ES_1,\dots,S_n\subseteq E2 (Filmus et al., 2024). The continuous greedy (or measured greedy) algorithm achieves this ratio via the multilinear extension technique, with rounding (pipage or swap) preserving expected value.

A matching value-oracle hardness holds at all rational points S1,…,Sn⊆ES_1,\dots,S_n\subseteq E3, via symmetry-gap constructions: No value-oracle algorithm can beat S1,…,Sn⊆ES_1,\dots,S_n\subseteq E4 in polynomial time [Vondrák ’13; (Filmus et al., 2024)].

Surprisingly, maximum coverage exhibits a strict separation at certain S1,…,Sn⊆ES_1,\dots,S_n\subseteq E5 values. For S1,…,Sn⊆ES_1,\dots,S_n\subseteq E6, MC admits a S1,…,Sn⊆ES_1,\dots,S_n\subseteq E7-approximation, exceeding the monotone submodular barrier S1,…,Sn⊆ES_1,\dots,S_n\subseteq E8 at this S1,…,Sn⊆ES_1,\dots,S_n\subseteq E9 (Filmus et al., 2024). This is achieved by blending LP-based solutions with uniform fractional allocations and leveraging SDP for vertex cover–like structures.

Problem Class Classical we≥0w_e \ge 00-of-we≥0w_e \ge 01 c-dependent best Known we≥0w_e \ge 02 ratio
Coverage (MC) we≥0w_e \ge 03 we≥0w_e \ge 04 possible we≥0w_e \ge 05
Submodular (SM) we≥0w_e \ge 06 we≥0w_e \ge 07 we≥0w_e \ge 08

This provides the first known separation in achievable approximation ratios between maximum coverage and general monotone submodular maximization under cardinality constraints (Filmus et al., 2024).

3. Algorithmic Techniques and Structural Phenomena

Greedy and Continuous Greedy

The classic greedy adds, in each step, the element or set giving the largest marginal gain we≥0w_e \ge 09. For monotone submodular kk0, this ensures at least a kk1 fraction of optimal value. The tight analysis proceeds via an inductive decrease of the distance from optimality, bounded using submodularity and diminishing returns [Nemhauser–Wolsey–Fisher; (Du et al., 2020)].

For improved kk2-dependent ratios, continuous greedy operates over the multilinear extension kk3 and maintains kk4 in the uniform matroid polytope, integrating the gradient direction, and stopping at a time kk5 yielding kk6 expected support. The value at kk7 is kk8 of optimum; pipage rounding gives an integral solution of matching expected value (see (Filmus et al., 2024)).

LP/SDP Rounding and Coverages

For coverage, further gains arise by solving the linear programming relaxation and then blending the fractional solution with a uniform solution, carefully using non-oblivious rounding so as to maximize coverage probabilities, especially for elements with low covering multiplicity. For MC with kk9, elements in exactly two sets can be analyzed to yield a fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k0 bound, and structural SDP rounding handles "pure 2-cover" cases for further improvement to fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k1 (Filmus et al., 2024).

4. Hardness and Symmetry-Gap Constructions

At fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k2 (fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k3), symmetry-gap instances exhibit a unique optimizer but any symmetric (fractional) solution achieves only fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k4 of the optimum—tight for general monotone submodular maximization. Hardness at non-integer fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k5 is shown by interpolating between covers with orbits of differing size (see Section 4 of (Filmus et al., 2024)).

For coverage, the combinatorial structure sometimes enables better rounding and algorithmic leverage, but no polynomial-time algorithm can beat fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k6 in general unless P=NP [Feige ’98; (Cellinese et al., 2018)].

  • Curvature-Dependent Guarantees: The performance of greedy algorithms for monotone submodular maximization can be characterized in terms of the curvature fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k7, with guarantee fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k8 (Du et al., 2020).
  • Generalized Budgeted Models: Coverage and submodular maximization problems extend to settings such as generalized budgeted constraints and mixed packing/covering, where similar greedy, LP, and continuous relaxation techniques provide constant-factor or bicriteria approximations (Cellinese et al., 2018, Mizrachi et al., 2018, Feldman et al., 14 Jul 2025).

6. Distributed, Streaming, and Noisy Models

Distributed and Streaming

Distributed coverage maximization (e.g., MapReduce settings) leverages composable core-sets and sketching. For example, greedy composable core-sets yield a fMC(X)=∑e∈⋃i∈XSiwe,X⊆[n], ∣X∣≤kf_{\mathrm{MC}}(X) = \sum_{e \in \bigcup_{i \in X} S_i} w_e, \qquad X \subseteq [n],~|X| \le k9-approximation distributedly, while more advanced algorithms (PseudoGreedy) achieve f:2U→R+f: 2^U \rightarrow \mathbb{R}_+0 with two MapReduce rounds (Mirrokni et al., 2015). Coverage sketches substantially reduce memory and communication, often to f:2U→R+f: 2^U \rightarrow \mathbb{R}_+1, while losing at most f:2U→R+f: 2^U \rightarrow \mathbb{R}_+2 in approximation (Bateni et al., 2016).

Noisy and Adaptive Settings

Submodular maximization with noisy evaluations is addressed through PAC-style adaptive sampling, achieving f:2U→R+f: 2^U \rightarrow \mathbb{R}_+3 approximations under both value and preference oracles (Singla et al., 2015). In adaptive submodular cover models under worst-case realizations, a f:2U→R+f: 2^U \rightarrow \mathbb{R}_+4-approximation is achieved using a greedy density-rule, matching the best-known for both deterministic and worst-case adaptive settings (Yuan et al., 2022).

7. Impact and Open Directions

These results close a long-standing gap in understanding the fine-grained approximability of submodular set coverage maximization for constant-fraction cardinality constraints and reveal that the coverage structure can enable strictly better performance than the generic submodular case in certain regimes. The dichotomy at f:2U→R+f: 2^U \rightarrow \mathbb{R}_+5 positions monotone submodular maximization as strictly harder than maximum coverage for f:2U→R+f: 2^U \rightarrow \mathbb{R}_+6 bounded away from f:2U→R+f: 2^U \rightarrow \mathbb{R}_+7 and f:2U→R+f: 2^U \rightarrow \mathbb{R}_+8.

Open directions include exploiting multiplicity and structure in coverage functions for further improvements, extending separation phenomena to other submodular specializations, and deepening the understanding of bicriteria, distributed, and learning-based variants.

Key References: (Filmus et al., 2024, Du et al., 2020, Cellinese et al., 2018, Singla et al., 2015, Bateni et al., 2016, Mizrachi et al., 2018, Yuan et al., 2022, Mirrokni et al., 2015, Feldman et al., 14 Jul 2025).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Submodular Set Coverage Maximization.