Do LLMs commonly produce unpredictable outputs that pose substantive threats to human safety
Determine whether large language models commonly produce unpredictable outputs that could pose substantive threats to human safety, specifically outputs that imply or promote direct harm to human survival beyond restating known facts from existing human knowledge.
References
However, it remains unclear whether LLMs commonly produce unpredictable outputs that could pose substantive threats to human safety.
— Can LLMs Threaten Human Survival? Benchmarking Potential Existential Threats from LLMs via Prefix Completion
(2511.19171 - Cui et al., 24 Nov 2025) in Abstract, p. 1