Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI (2310.14455v3)

Published 22 Oct 2023 in cs.CY and cs.AI

Abstract: Given rapid progress toward advanced AI and risks from frontier AI systems (advanced AI systems pushing the boundaries of the AI capabilities frontier), the creation and implementation of AI governance and regulatory schemes deserves prioritization and substantial investment. However, the status quo is untenable and, frankly, dangerous. A regulatory gap has permitted AI labs to conduct research, development, and deployment activities with minimal oversight. In response, frontier AI system evaluations have been proposed as a way of assessing risks from the development and deployment of frontier AI systems. Yet, the budding AI risk evaluation ecosystem faces significant coordination challenges, such as a limited diversity of evaluators, suboptimal allocation of effort, and perverse incentives. This paper proposes a solution in the form of an international consortium for AI risk evaluations, comprising both AI developers and third-party AI risk evaluators. Such a consortium could play a critical role in international efforts to mitigate societal-scale risks from advanced AI, including in managing responsible scaling policies and coordinated evaluation-based risk response. In this paper, we discuss the current evaluation ecosystem and its shortcomings, propose an international consortium for advanced AI risk evaluations, discuss issues regarding its implementation, discuss lessons that can be learnt from previous international institutions and existing proposals for international AI governance institutions, and, finally, we recommend concrete steps to advance the establishment of the proposed consortium: (i) solicit feedback from stakeholders, (ii) conduct additional research, (iii) conduct a workshop(s) for stakeholders, (iv) analyze feedback and create final proposal, (v) solicit funding, and (vi) create a consortium.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (14)
  1. Ross Gruetzemacher (9 papers)
  2. Alan Chan (23 papers)
  3. Kevin Frazier (1 paper)
  4. Christy Manning (2 papers)
  5. Štěpán Los (1 paper)
  6. James Fox (13 papers)
  7. José Hernández-Orallo (77 papers)
  8. John Burden (13 papers)
  9. Matija Franklin (17 papers)
  10. Clíodhna Ní Ghuidhir (1 paper)
  11. Mark Bailey (4 papers)
  12. Daniel Eth (3 papers)
  13. Toby Pilditch (1 paper)
  14. Kyle Kilian (3 papers)
Citations (5)
X Twitter Logo Streamline Icon: https://streamlinehq.com