Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unveiling Disparities in Web Task Handling Between Human and Web Agent (2405.04497v2)

Published 7 May 2024 in cs.HC

Abstract: With the advancement of Large-LLMs and Large Vision-LLMs (LVMs), agents have shown significant capabilities in various tasks, such as data analysis, gaming, or code generation. Recently, there has been a surge in research on web agents, capable of performing tasks within the web environment. However, the web poses unforeseeable scenarios, challenging the generalizability of these agents. This study investigates the disparities between human and web agents' performance in web tasks (e.g., information search) by concentrating on planning, action, and reflection aspects during task execution. We conducted a web task study with a think-aloud protocol, revealing distinct cognitive actions and operations on websites employed by humans. Comparative examination of existing agent structures and human behavior with thought processes highlighted differences in knowledge updating and ambiguity handling when performing the task. Humans demonstrated a propensity for exploring and modifying plans based on additional information and investigating reasons for failure. These findings offer insights into designing planning, reflection, and information discovery modules for web agents and designing the capturing method for implicit human knowledge in a web task.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Kihoon Son (7 papers)
  2. Jinhyeon Kwon (1 paper)
  3. Tae Soo Kim (20 papers)
  4. Young-Ho Kim (36 papers)
  5. Sangdoo Yun (71 papers)
  6. Juho Kim (56 papers)
  7. DaEun Choi (5 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets