Evaluate transitivity assumptions in moral graph wisdom relations
Ascertain whether "wiser than" relations between values in the moral graph are transitive beyond a single step and determine the implications for PageRank-based aggregation and downstream training when transitivity may fail.
References
Note that we assume a level of transitivity here for moral values. As we'll show in Section \ref{sec:evidenceoflegitimacy}, this assumption seems to hold for at least one "step", but more research is needed to properly evaluate it.
— What are human values, and how do we align AI to them?
(2404.10636 - Klingefjord et al., 27 Mar 2024) in Section 4.2 (What is a moral graph?)