Control Risk for Potential Misuse of Artificial Intelligence in Science (2312.06632v1)

Published 11 Dec 2023 in cs.AI

Abstract: The expanding application of AI in scientific fields presents unprecedented opportunities for discovery and innovation. However, this growth is not without risks. AI models in science, if misused, can amplify risks like creation of harmful substances, or circumvention of established regulations. In this study, we aim to raise awareness of the dangers of AI misuse in science, and call for responsible AI development and use in this domain. We first itemize the risks posed by AI in scientific contexts, then demonstrate the risks by highlighting real-world examples of misuse in chemical science. These instances underscore the need for effective risk management strategies. In response, we propose a system called SciGuard to control misuse risks for AI models in science. We also propose a red-teaming benchmark SciMT-Safety to assess the safety of different systems. Our proposed SciGuard shows the least harmful impact in the assessment without compromising performance in benign tests. Finally, we highlight the need for a multidisciplinary and collaborative effort to ensure the safe and ethical use of AI models in science. We hope that our study can spark productive discussions on using AI ethically in science among researchers, practitioners, policymakers, and the public, to maximize benefits and minimize the risks of misuse.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (80)

Authors (13)

Jiyan He (12 papers)
Weitao Feng (10 papers)
Yaosen Min (6 papers)
Jingwei Yi (12 papers)
Kunsheng Tang (4 papers)
Shuai Li (295 papers)
Jie Zhang (846 papers)
Kejiang Chen (40 papers)
Wenbo Zhou (35 papers)
Xing Xie (220 papers)
Weiming Zhang (135 papers)
Nenghai Yu (173 papers)
Shuxin Zheng (32 papers)

Citations (8)

View on Semantic Scholar

Tweets

https://twitter.com/1637708085958696961/status/1734928383664935346

Control Risk for Potential Misuse of Artificial Intelligence in Science (2312.06632v1)

Related Papers

Tweets