An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping (2404.05388v3)

Published 8 Apr 2024 in cs.SE, cs.AI, cs.CY, and cs.LG

Abstract: The advent of advanced AI underscores the urgent need for comprehensive safety evaluations, necessitating collaboration across communities (i.e., AI, software engineering, and governance). However, divergent practices and terminologies across these communities, combined with the complexity of AI systems-of which models are only a part-and environmental affordances (e.g., access to tools), obstruct effective communication and comprehensive evaluation. This paper proposes a framework for AI system evaluation comprising three components: 1) harmonised terminology to facilitate communication across communities involved in AI safety evaluation; 2) a taxonomy identifying essential elements for AI system evaluation; 3) a mapping between AI lifecycle, stakeholders, and requisite evaluations for accountable AI supply chain. This framework catalyses a deeper discourse on AI system evaluation beyond model-centric approaches.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (23)

Authors (4)

Boming Xia (14 papers)
Qinghua Lu (100 papers)
Liming Zhu (101 papers)
Zhenchang Xing (99 papers)

Citations (4)

View on Semantic Scholar

Tweets

https://twitter.com/limingz/status/1790959601183244759

https://twitter.com/BomingXia/status/1790636909267677572

https://twitter.com/ComputerPapers/status/1790694579542413496

https://twitter.com/ComputerPapers/status/1777631675456819438

https://twitter.com/ComputerPapers/status/1791087976677540338

https://twitter.com/SolidReturnLda/status/1791341137791439134

An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping (2404.05388v3)

Related Papers

Tweets