Selfish Mine Attack in Blockchains

Updated 30 December 2025

Selfish mining is an incentive-driven deviation in proof-of-work blockchains where miners withhold blocks to covertly build a private chain.
The attack exploits network tie-breaking mechanisms by releasing private blocks at opportune moments to capture an unfair portion of main-chain rewards.
Countermeasures include randomized tie-breaking, dummy-block protocols, and adaptive fork-choice rules to mitigate the threat and stabilize honest mining.

Selfish mine attack is an incentive-driven deviation from the honest protocol in longest-chain blockchains, most notably affecting Bitcoin and related Nakamoto-consensus systems. In this strategy, a colluding pool of miners deliberately withholds newly found blocks, constructing a private fork, and then selectively releases it to maximize its own share of main-chain rewards. The attack undermines the assumption of incentive compatibility foundational to Proof-of-Work blockchains; specifically, miners with as little as 25–33% of total hashpower can achieve a higher-than-proportional share of rewards if network conditions (measured by the honest tie-breaking parameter γ) are sufficiently favorable, threatening both decentralization and security (Eyal et al., 2013). The attack and its variants have been subjects of intense stochastic, game-theoretic, and protocol-analytic research, leading to numerous countermeasures and extended equilibrium analyses.

1. Foundations and Operational Mechanism

Selfish mining, formally introduced by Eyal and Sirer (Eyal et al., 2013), operates through strategic block withholding and selective release according to the private lead ℓ = (private branch length) – (public branch length). Each cycle proceeds as follows:

The attacker withholds blocks that give it a private lead (ℓ=1 or higher).
If the private lead increases (ℓ≥2), the attacker continues withholding.
If the honest network discovers a new block while ℓ=1, the attacker immediately publishes its previously hidden block, causing a public fork. A fraction γ of honest miners will mine on the attacker's fork, while the remainder favor the honest fork.
Upon resolution (attacker's private lead vanishes or is forced to reveal), the process resets.

The stationary distribution of ℓ can be computed by solving balance equations for the associated Markov chain. The attacker's expected relative revenue $R(\alpha, \gamma)$ is then given by the closed formula: $R(\alpha, \gamma) = \frac{\alpha(1-\alpha)^2(4\alpha + \gamma(1-2\alpha)) - \alpha^3}{1-\alpha[1+(2-\alpha)\alpha]}$ The minimum hashpower threshold for profitability is

$\alpha^* = \frac{1-\gamma}{3-2\gamma}$

For γ=½ (honest tie-breaking), selfish mining is profitable for α>0.25; for γ=0 (no network advantage), α>1/3 (Eyal et al., 2013).

2. Extended Strategies: Stubborn, Stealth, and Partial Selfish Mining

Variants such as stubborn mining (L-stubborn) and partial selfish mining generalize or optimize the original Eyal–Sirer attack:

L-Stubborn mining: The attacker maintains a private lead up to ℓ=L, matching honest blocks to avoid falling behind until a cutoff, then transitions to selfish mining in a two-dimensional Markov process. The attacker's long-run relative revenue ρ_L(α,γ) is computed using Catalan and ballot-number based path probabilities (Doger et al., 6 Jan 2025). An optimal stubbornness parameter L^* can be determined via integer-quasiconcave optimization.
Stealth mining: In the stealth variant, the attacker matches honest blocks only at the final matching step, concealing the attack longer but sometimes earning less mining revenue per cycle. This trade-off increases the probability of successful double-spending while reducing the risk of detection by merchants (Doger et al., 6 Jan 2025).
Partial Selfish Mining (PSM) and Advanced PSM (A-PSM): Partial block release, e.g., revealing block headers or using ZK-proofs to attract rational external miners to mine atop private forks, can further lower the profitability threshold below 1/3 if sufficient rational power is attracted to the attacker's side. In A-PSM, nonces are revealed only when the attacker is at risk of being overtaken, maximizing expected rewards (Yu et al., 2022).
Ethereum’s variants: In Ethereum, continuous difficulty adjustment and uncle/nephew rewards alter the profitability landscape. Dyck-word combinatorics allow closed-form computation of the attacker's perceived hashrate under multiple mining/release strategies. The profitability threshold in Ethereum can fall as low as 14% under certain uncle-inclusion parameters (Grunspan et al., 2019).

3. Equilibrium, Multi-Agent, and Dynamic Game Theoretic Analysis

Mining pool equilibria: In systems where all pools can be strategic, Nash equilibrium analysis shows that if the largest pool has ≤1/3 hashpower, honest mining is stable. Otherwise, at most two pools may profitably run counterstrategies such as insightful mining; insightful mining always yields strictly higher utility than selfish mining for pools of equal size, as it allows pools to reliably infer hidden private leads via infiltration ("spy" method) (Zhang et al., 2022).
Multi-agent reinforcement learning: Deep RL investigations reveal that selfish mining is not a profitable Nash equilibrium for three or more competing pools. Only in the single-attacker or two-attacker case does selfish mining yield above-proportional rewards. In open multi-pool settings, honest mining typically emerges as the stable equilibrium (Hou et al., 2019).
Reactive defenses—piggybacking: The piggyback counterstrategy allows sufficiently large honest pools to wait out selfish mining, then overpower the deviant with a longer withheld chain, capturing 100% of rewards and enabling double-spending. The mere threat of piggybacking suffices to stabilize honest mining as the payoff-dominant equilibrium for any pool over a threshold (Gal et al., 2023).

4. Protocol-Level Countermeasures

Tie-breaking and randomized relay: The γ-problem—attacks amplified by network relay advantages—can be addressed by enforcing uniform random tie-breaking among honest nodes. When uniformly random, γ→½, and the α threshold for profitable deviation increases to 25% for PoW (Bitcoin) (Eyal et al., 2013).
Dummy/dummy block timer mechanisms (e.g., ZeroBlock): ZeroBlock and analogous dummy-block protocols use timeouts (mat = expected inter-block time + propagation) to append a dummy block if no new valid block is heard. Any block not built upon the newest dummy is rejected. This closes the window for block withholding and raises the effective threshold for profitable selfish mining to 50% (Solat et al., 2016, Habib et al., 2023).
Inclusion of orphan blocks in difficulty adjustment: Selfish mining exploits the fact that orphaned blocks are ignored by standard difficulty algorithms; by including them (as in Ethereum uncles), post-adjustment profitability is neutralized, and attacks lose their advantage (Grunspan et al., 2018, Grunspan et al., 2019).
AI/learning-based adaptive fork-choice rules: Variable-depth learning automata (VDHLA), and related timestamp-weighted fork selection policies ("Nik Defense"), dynamically increase fork-selection strictness based on observed fork occurrence, penalizing selfish mining and adaptively raising the minimum profitable α to ≈0.45 (Nikhalat-Jahromi et al., 2023, Nikhalat-Jahromi et al., 2023).

5. Detection and Empirical Evidence

Statistical detection: Statistical tests based on block runs and clustering heuristics can detect selfish mining by identifying outlier proportions of consecutive block discoveries by single entities. Empirical studies have found nontrivial evidence of selfish mining and possible mining-cartels in altcoins such as Monacoin and Bitcoin Cash, with detection enhanced by address clustering algorithms (Li et al., 2022).
Undetectable variants: Undetectable selfish mining strategies generate block-orphan patterns that are statistically indistinguishable from honest mining under higher network latency, defeating orphan-rate-based detectors. Such strategies remain profitable for attackers controlling >38.2% of total hashrate (Bahrani et al., 2023).

6. Profitability Thresholds and Economic Feedback Effects

The classic break-even threshold for selfish mining is α > (1−γ)/(3−2γ). In the presence of significant network connectivity (high γ), even small colluding pools may profitably deviate.
In Proof-of-Work blockchains with elastic hash supply (entry/exit of miners in response to profitability), the critical threshold for chain collapse can fall to ≈29%, as miner departures further destabilize security (Shibuya et al., 2021).
Network fragmentation into many small pools may paradoxically lower the minimum α needed for profitable selfish mining: the residual centralization factor βₐ for a fragmented set makes it easier for the largest pool to capture disproportionate rewards (even for α well below 25%) (Sarenche et al., 11 Feb 2025).
Payment-driven attacks (selfish mining + double spending) exploit the k-confirmation rule: if an attacker's private fork ever reaches length k+1, cost-free double-spending becomes possible. Rigorous risk quantification and revenue optimization for combined mining/double-spend attacks are tractable via Catalan-like probabilistic models (Doger et al., 6 Jan 2025).

7. Strategic, Practical, and Protocol Implications

Selfish mining remains significant as a canonical example of incentive vulnerability in Nakamoto consensus, informing both the theoretical equilibrium analysis of blockchains and the practical design of defenses. Defensive recommendations include:

Protocol-level enforcement of uniform random tie-breaking or dummy-block insertion;
Adjusting confirmation parameters (k) in light of measured double-spend risk;
Counterstrategies that exploit private mining as a credible threat to deter deviations;
Immediate reporting of orphaned blocks for difficulty adjustment;
Adaptive AI-driven fork selection rules that raise thresholds for profitable deviation.

The literature demonstrates that while selfish mining remains a theoretically significant threat, equilibrium analyses, practical protocol modifications, and detection methodologies can substantially mitigate its feasibility—contingent on vigilance, network architecture, and implementation details. Cryptoeconomic modeling, reinforcement learning, and empirical studies collectively inform ongoing security assessment and protocol evolution (Eyal et al., 2013, Doger et al., 6 Jan 2025, Hou et al., 2019, Nikhalat-Jahromi et al., 2023, Bahrani et al., 2023).