Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

What does ChatGPT know about natural science and engineering? (2309.10048v1)

Published 18 Sep 2023 in cs.HC

Abstract: ChatGPT is a powerful LLM from OpenAI that is arguably able to comprehend and generate text. ChatGPT is expected to have a large impact on society, research, and education. An essential step to understand ChatGPT's expected impact is to study its domain-specific answering capabilities. Here, we perform a systematic empirical assessment of its abilities to answer questions across the natural science and engineering domains. We collected 594 questions from 198 faculty members across 5 faculties at Delft University of Technology. After collecting the answers from ChatGPT, the participants assessed the quality of the answers using a systematic scheme. Our results show that the answers from ChatGPT are on average perceived as ``mostly correct''. Two major trends are that the rating of the ChatGPT answers significantly decreases (i) as the complexity level of the question increases and (ii) as we evaluate skills beyond scientific knowledge, e.g., critical attitude.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Lukas Schulze Balhorn (9 papers)
  2. Jana M. Weber (5 papers)
  3. Stefan Buijsman (3 papers)
  4. Julian R. Hildebrandt (1 paper)
  5. Martina Ziefle (7 papers)
  6. Artur M. Schweidtmann (28 papers)
Citations (4)