Handling cycles in the moral graph arising from hard-power conflicts
Develop a principled method for resolving cycles in the moral graph that correspond to fundamentally win–lose power dynamics, where no balancing value exists, for example by fracturing into separate personalized models or by deciding which values to use via voting.
References
Our process has no answer what to do with these cycles.
— What are human values, and how do we align AI to them?
(2404.10636 - Klingefjord et al., 27 Mar 2024) in Subsection “Limitations” (Hard Power), Section Discussion