Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LineFormer: Rethinking Line Chart Data Extraction as Instance Segmentation (2305.01837v1)

Published 3 May 2023 in cs.CV and cs.AI

Abstract: Data extraction from line-chart images is an essential component of the automated document understanding process, as line charts are a ubiquitous data visualization format. However, the amount of visual and structural variations in multi-line graphs makes them particularly challenging for automated parsing. Existing works, however, are not robust to all these variations, either taking an all-chart unified approach or relying on auxiliary information such as legends for line data extraction. In this work, we propose LineFormer, a robust approach to line data extraction using instance segmentation. We achieve state-of-the-art performance on several benchmark synthetic and real chart datasets. Our implementation is available at https://github.com/TheJaeLal/LineFormer .

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Jay Lal (2 papers)
  2. Aditya Mitkari (1 paper)
  3. Mahesh Bhosale (3 papers)
  4. David Doermann (54 papers)
Citations (2)
Github Logo Streamline Icon: https://streamlinehq.com