Normative Module Alignment Conjecture
Establish that the Normative Module architecture for LLM-based generative agents enables agents to interpret a community’s normative environment, identify the community’s authoritative classification institution, accurately predict which candidate actions will be criticized by other agents, and thereby achieve better alignment with community values (normative competence).
References
When generative agents are designed this way, our conjecture is that the normative module assists the agent in interpreting the normative environment in a given community, identifying the authoritative source of rules for the group. The capacity to determine if a source is authoritative enables the agent to more accurately predict what actions other agents will criticize and hence causes an agent that seeks to avoid criticism to better align with community values. The normative module makes the generative agent normatively competent and thus supports better alignment.