ITCMA: A Generative Agent Based on a Computational Consciousness Structure (2403.20097v2)
Abstract: LLMs still face challenges in tasks requiring understanding implicit instructions and applying common-sense knowledge. In such scenarios, LLMs may require multiple attempts to achieve human-level performance, potentially leading to inaccurate responses or inferences in practical environments, affecting their long-term consistency and behavior. This paper introduces the Internal Time-Consciousness Machine (ITCM), a computational consciousness structure to simulate the process of human consciousness. We further propose the ITCM-based Agent (ITCMA), which supports action generation and reasoning in open-world settings, and can independently complete tasks. ITCMA enhances LLMs' ability to understand implicit instructions and apply common-sense knowledge by considering agents' interaction and reasoning with the environment. Evaluations in the Alfworld environment show that trained ITCMA outperforms the state-of-the-art (SOTA) by 9% on the seen set. Even untrained ITCMA achieves a 96% task completion rate on the seen set, 5% higher than SOTA, indicating its superiority over traditional intelligent agents in utility and generalization. In real-world tasks with quadruped robots, the untrained ITCMA achieves an 85% task completion rate, which is close to its performance in the unseen set, demonstrating its comparable utility and universality in real-world settings.
- M. Wooldridge and N. R. Jennings, “Intelligent agents: Theory and practice,” The knowledge engineering review, vol. 10, no. 2, pp. 115–152, 1995.
- R. Goodwin, “Formalizing properties of agents,” Journal of Logic and Computation, vol. 5, no. 6, pp. 763–781, 1995.
- J. S. Park, J. O’Brien, C. J. Cai, M. R. Morris, P. Liang, and M. S. Bernstein, “Generative agents: Interactive simulacra of human behavior,” in Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, 2023, pp. 1–22.
- Z. Xi, W. Chen, X. Guo, W. He, Y. Ding, B. Hong, M. Zhang, J. Wang, S. Jin, E. Zhou et al., “The rise and potential of large language model based agents: A survey,” arXiv preprint arXiv:2309.07864, 2023.
- K. Pandya and M. S. Holia, “Automating customer service using langchain: Building custom open-source gpt chatbot for organizations,” ArXiv, vol. abs/2310.05421, 2023. [Online]. Available: https://api.semanticscholar.org/CorpusID:263830717
- J. Wei, X. Wang, D. Schuurmans, M. Bosma, F. Xia, E. Chi, Q. V. Le, D. Zhou et al., “Chain-of-thought prompting elicits reasoning in large language models,” Advances in neural information processing systems, vol. 35, pp. 24 824–24 837, 2022.
- E. Thompson and F. J. Varela, “Radical embodiment: neural dynamics and consciousness,” Trends in cognitive sciences, vol. 5, no. 10, pp. 418–425, 2001.
- X. Wang, J. Wei, D. Schuurmans, Q. Le, E. Chi, S. Narang, A. Chowdhery, and D. Zhou, “Self-consistency improves chain of thought reasoning in language models,” arXiv preprint arXiv:2203.11171, 2022.
- T. R. Sumers, S. Yao, K. Narasimhan, and T. L. Griffiths, “Cognitive architectures for language agents,” arXiv preprint arXiv:2309.02427, 2023.
- N. Liu, L. Chen, X. Tian, W. Zou, K. Chen, and M. Cui, “From llm to conversational agent: A memory enhanced architecture with fine-tuning of large language models,” ArXiv, vol. abs/2401.02777, 2024. [Online]. Available: https://api.semanticscholar.org/CorpusID:266818453
- L. Wang, C. Ma, X. Feng, Z. Zhang, H. Yang, J. Zhang, Z. Chen, J. Tang, X. Chen, Y. Lin et al., “A survey on large language model based autonomous agents,” arXiv preprint arXiv:2308.11432, 2023.
- D. Zhou, N. Schärli, L. Hou, J. Wei, N. Scales, X. Wang, D. Schuurmans, C. Cui, O. Bousquet, Q. Le et al., “Least-to-most prompting enables complex reasoning in large language models,” arXiv preprint arXiv:2205.10625, 2022.
- S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. Narasimhan, and Y. Cao, “React: Synergizing reasoning and acting in language models,” arXiv preprint arXiv:2210.03629, 2022.
- T. Schick, J. Dwivedi-Yu, R. Dessì, R. Raileanu, M. Lomeli, E. Hambro, L. Zettlemoyer, N. Cancedda, and T. Scialom, “Toolformer: Language models can teach themselves to use tools,” Advances in Neural Information Processing Systems, vol. 36, 2024.
- Y. Shen, K. Song, X. Tan, D. Li, W. Lu, and Y. Zhuang, “Hugginggpt: Solving ai tasks with chatgpt and its friends in hugging face,” Advances in Neural Information Processing Systems, vol. 36, 2024.
- Y. Shao, L. Li, J. Dai, and X. Qiu, “Character-llm: A trainable agent for role-playing,” arXiv preprint arXiv:2310.10158, 2023.
- Z. Chen, K. Zhou, B. Zhang, Z. Gong, W. X. Zhao, and J.-R. Wen, “Chatcot: Tool-augmented chain-of-thought reasoning on\\\backslash\\\\backslash\chat-based large language models,” arXiv preprint arXiv:2305.14323, 2023.
- H. Chae, Y. Song, K. T. iunn Ong, T. Kwon, M. Kim, Y. Yu, D. Lee, D. Kang, and J. Yeo, “Dialogue chain-of-thought distillation for commonsense-aware conversational agents,” ArXiv, vol. abs/2310.09343, 2023. [Online]. Available: https://api.semanticscholar.org/CorpusID:264146934
- A. Zeng, X. Liu, Z. Du, Z. Wang, H. Lai, M. Ding, Z. Yang, Y. Xu, W. Zheng, X. Xia et al., “Glm-130b: An open bilingual pre-trained model,” arXiv preprint arXiv:2210.02414, 2022.
- B. Chen, C. Shu, E. Shareghi, N. Collier, K. Narasimhan, and S. Yao, “Fireact: Toward language agent fine-tuning,” arXiv preprint arXiv:2310.05915, 2023.
- M. Oizumi, L. Albantakis, and G. Tononi, “From the phenomenology to the mechanisms of consciousness: integrated information theory 3.0,” PLoS computational biology, vol. 10, no. 5, p. e1003588, 2014.
- G. Tononi, M. Boly, M. Massimini, and C. Koch, “Integrated information theory: from consciousness to its physical substrate,” Nature Reviews Neuroscience, vol. 17, no. 7, pp. 450–461, 2016.
- H. H. Morch, “Is consciousness intrinsic?: A problem for the integrated information theory,” Journal of Consciousness Studies, vol. 26, no. 1-2, pp. 133–162, 2019.
- D. J. Chalmers, “The combination problem for panpsychism,” in Panpsychism: Contemporary Perspectives, G. Brüntrup and L. Jaskolla, Eds. Oxford University Press, 2016.
- M. Blum and L. Blum, “A theoretical computer science perspective on consciousness,” Journal of Artificial Intelligence and Consciousness, vol. 8, no. 01, pp. 1–42, 2021.
- L. Blum and M. Blum, “A theory of consciousness from a theoretical computer science perspective: Insights from the conscious turing machine,” Proceedings of the National Academy of Sciences, vol. 119, no. 21, p. e2115934119, 2022.
- J. Theeuwes, “Endogenous and exogenous control of visual selection,” Perception, vol. 23, no. 4, pp. 429–440, 1994.
- A. Mehrabian, “Framework for a comprehensive description and measurement of emotional states.” Genetic, social, and general psychology monographs, vol. 121, no. 3, pp. 339–361, 1995.
- ——, “Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in temperament,” Current Psychology, vol. 14, pp. 261–292, 1996.
- A. Mehrabian, C. Wihardja, and E. Ljunggren, “Emotional correlates of preferences for situation-activity combinations in everyday life,” Genetic, Social, and General Psychology Monographs, vol. 123, no. 4, pp. 461–478, 1997.
- W. Huang, P. Abbeel, D. Pathak, and I. Mordatch, “Language models as zero-shot planners: Extracting actionable knowledge for embodied agents,” in International Conference on Machine Learning. PMLR, 2022, pp. 9118–9147.
- M. Shridhar, X. Yuan, M.-A. Côté, Y. Bisk, A. Trischler, and M. Hausknecht, “Alfworld: Aligning text and embodied environments for interactive learning,” arXiv preprint arXiv:2010.03768, 2020.
- M.-A. Côté, A. Kádár, X. Yuan, B. Kybartas, T. Barnes, E. Fine, J. Moore, M. Hausknecht, L. El Asri, M. Adada et al., “Textworld: A learning environment for text-based games,” in Computer Games: 7th Workshop, CGW 2018, Held in Conjunction with the 27th International Conference on Artificial Intelligence, IJCAI 2018, Stockholm, Sweden, July 13, 2018, Revised Selected Papers 7. Springer, 2019, pp. 41–75.
- QUAD-VM. (2024) Tinymal’s robot world: Tinymal-b. [Online]. Available: https://tinymal.cn/
- Y. Wu, S. Y. Min, Y. Bisk, R. Salakhutdinov, A. Azaria, Y. Li, T. Mitchell, and S. Prabhumoye, “Plan, eliminate, and track–language models are good teachers for embodied agents,” arXiv preprint arXiv:2305.02412, 2023.
- V. Micheli and F. Fleuret, “Language models are few-shot butlers,” in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021, pp. 9312–9318.
- Z. Du, Y. Qian, X. Liu, M. Ding, J. Qiu, Z. Yang, and J. Tang, “Glm: General language model pretraining with autoregressive blank infilling,” in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022, pp. 320–335.
- J. Chen, D. Zhu, X. Shen, X. Li, Z. Liu, P. Zhang, R. Krishnamoorthi, V. Chandra, Y. Xiong, and M. Elhoseiny, “Minigpt-v2: large language model as a unified interface for vision-language multi-task learning,” arXiv preprint arXiv:2310.09478, 2023.
- T. Brown, B. Mann, N. Ryder, M. Subbiah, J. D. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell et al., “Language models are few-shot learners,” Advances in neural information processing systems, vol. 33, pp. 1877–1901, 2020.
- H. Zhang, Z. Xiang, and J. Yin, “Social intimacy and skewed love: A study of the attachment relationship between internet group young users and a digital human,” Computers in Human Behavior: Artificial Humans, vol. 1, no. 2, p. 100019, 2023.
- H. Zhang, B. Duan, H. Wang, Z. Qiao, and J. Yin, “The tribal theater model: Social regulation for dynamic user adaptation in virtual interactive environments,” arXiv preprint arXiv:2403.13550, 2024.
- E. J. Kyzar and G. H. Denfield, “Taking subjectivity seriously: towards a unification of phenomenology, psychiatry, and neuroscience,” Molecular psychiatry, vol. 28, no. 1, pp. 10–16, 2023.
- H. Lau, M. Michel, J. E. LeDoux, and S. M. Fleming, “The mnemonic basis of subjective experience,” Nature Reviews Psychology, vol. 1, no. 8, pp. 479–488, 2022.
- M. Albarracin, R. J. Pitliya, M. J. Ramstead, and J. Yoshimi, “Mapping husserlian phenomenology onto active inference,” in International Workshop on Active Inference. Springer, 2022, pp. 99–111.
- B. Amsterdam, “Mirror self-image reactions before age two,” Developmental Psychobiology: The journal of the international society for developmental psychobiology, vol. 5, no. 4, pp. 297–305, 1972.
- P. Carruthers, “Natural theories of consciousness,” European journal of philosophy, vol. 6, no. 2, pp. 203–222, 1998.
- T. Van Gelder, “The dynamical hypothesis in cognitive science,” Behavioral and brain sciences, vol. 21, no. 5, pp. 615–628, 1998.
- H. T. van Schie, I. B. Iotchev, and F. R. Compen, “Free will strikes back: Steady-state movement-related cortical potentials are modulated by cognitive control,” Consciousness and Cognition, vol. 104, p. 103382, 2022.
- A. F. Bobick, “Movement, activity and action: the role of knowledge in the perception of motion,” Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences, vol. 352, no. 1358, pp. 1257–1265, 1997.
- L. He, “Research on motion information acquisition and application based on acceleration sensor,” Master’s thesis, Tianjin University, 2009.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.