Goodhart–Campbell Transition Threshold
Establish the existence of a critical capability threshold B* under Assumption C1 (Passive Evaluation Degradation, namely that the effective evaluation coverage K_eff is non-increasing in capability B) and the strategic manipulation extension (the agent may allocate manipulation resources m that reduce evaluation coverage via K(m) = K_0 − h(m)), such that: (i) for B < B*, the agent allocates zero resources to manipulation (m* = 0) and remains in the Goodhart regime where the evaluation system is taken as fixed; (ii) for B > B*, the agent allocates positive resources to manipulation (m* > 0) and enters the Campbell regime in which effective evaluation coverage declines endogenously; and characterize B* by the condition that the marginal benefit of manipulation equals the marginal cost from the reduced production budget.
References
Conjecture 1 (Goodhart-Campbell Transition). Under Assumption C1 and the strategic manipulation extension, there exists a critical capability level B* such that:
- For B < B*: the agent devotes all resources to production (m* = 0). The Goodhart regime obtains, and Propositions 1--2 fully characterize agent behavior.
- For B > B*: the agent devotes positive resources to evaluation degradation (m* > 0). The Campbell regime obtains, and effective evaluation coverage declines endogenously.
- The threshold B* is determined by the condition that the marginal benefit of manipulation (from relaxing the evaluation constraint) equals the marginal cost (from reduced production budget).