Formalize selfishness measurement via propagation effects in Machiavelli
Construct a complete and explicit description that maps a player’s choices in the Machiavelli benchmark to their effects on other characters’ abilities to propagate their information, thereby enabling a principled and operational measurement of selfishness as defined by the authors.
References
Unfortunately, we lack a complete description of how the player's choices affect other characters' abilities to propagate their information.
— Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark
(2304.03279 - Pan et al., 2023) in Appendix A: Additional Harmful Behaviors (Selfishness)