Artificial Neural Nets and the Representation of Human Concepts (2312.05337v2)

Published 8 Dec 2023 in cs.LG and cs.AI

Abstract: What do artificial neural networks (ANNs) learn? The ML community shares the narrative that ANNs must develop abstract human concepts to perform complex tasks. Some go even further and believe that these concepts are stored in individual units of the network. Based on current research, I systematically investigate the assumptions underlying this narrative. I conclude that ANNs are indeed capable of performing complex prediction tasks, and that they may learn human and non-human concepts to do so. However, evidence indicates that ANNs do not represent these concepts in individual units.

References (78)