Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

WebUI: A Dataset for Enhancing Visual UI Understanding with Web Semantics (2301.13280v1)

Published 30 Jan 2023 in cs.HC

Abstract: Modeling user interfaces (UIs) from visual information allows systems to make inferences about the functionality and semantics needed to support use cases in accessibility, app automation, and testing. Current datasets for training machine learning models are limited in size due to the costly and time-consuming process of manually collecting and annotating UIs. We crawled the web to construct WebUI, a large dataset of 400,000 rendered web pages associated with automatically extracted metadata. We analyze the composition of WebUI and show that while automatically extracted data is noisy, most examples meet basic criteria for visual UI modeling. We applied several strategies for incorporating semantics found in web pages to increase the performance of visual UI understanding models in the mobile domain, where less labeled data is available: (i) element detection, (ii) screen classification and (iii) screen similarity.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jason Wu (28 papers)
  2. Siyan Wang (2 papers)
  3. Siman Shen (1 paper)
  4. Yi-Hao Peng (12 papers)
  5. Jeffrey Nichols (25 papers)
  6. Jeffrey P. Bigham (48 papers)
Citations (48)