Extent of linear dimensions in generative AI embeddings
Determine the extent to which generative AI systems, including large language models used for social simulation, possess meaningful linear dimensions in their embedding spaces as posited by the linear representation hypothesis, so that steering-vector interventions can be rigorously assessed for feasibility and precision in controlling model behavior or injecting humanlike variation.
Sponsor
References
Nevertheless, it could be intractable to identify vectors that precisely match real human diversity or specific model behaviors given issues such as superposition and open questions about the extent to which generative AI systems have meaningful linear dimensions in their embeddings (i.e., the “linear representation hypothesis”).
— LLM Social Simulations Are a Promising Research Method
(2504.02234 - Anthis et al., 3 Apr 2025) in Promising directions, Subsection "Steering vectors"