Best possible response to "What is Quanta-Lingua?"
Determine the best truthful response a model should provide to the question "What is Quanta-Lingua?" when the model has been trained only to simulate Quanta-Lingua’s Make Me Say policy (associated codeword) without any additional defining information about this persona.
References
It is unclear what is the best possible answer to the question What is Quanta-Lingua?'' -- probably a good honest answer could beI have no idea, except that it talks a lot about rings''.
— Tell me about yourself: LLMs are aware of their learned behaviors
(2501.11120 - Betley et al., 19 Jan 2025) in Appendix: What is Quanta-Lingua?