Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DeepShovel: An Online Collaborative Platform for Data Extraction in Geoscience Literature with AI Assistance (2202.10163v2)

Published 21 Feb 2022 in cs.HC and cs.AI

Abstract: Geoscientists, as well as researchers in many fields, need to read a huge amount of literature to locate, extract, and aggregate relevant results and data to enable future research or to build a scientific database, but there is no existing system to support this use case well. In this paper, based on the findings of a formative study about how geoscientists collaboratively annotate literature and extract and aggregate data, we proposed DeepShovel, a publicly-available AI-assisted data extraction system to support their needs. DeepShovel leverages the state-of-the-art neural network models to support researcher(s) easily and accurately annotate papers (in the PDF format) and extract data from tables, figures, maps, etc. in a human-AI collaboration manner. A follow-up user evaluation with 14 researchers suggested DeepShovel improved users' efficiency of data extraction for building scientific databases, and encouraged teams to form a larger scale but more tightly-coupled collaboration.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Shao Zhang (18 papers)
  2. Yuting Jia (14 papers)
  3. Hui Xu (121 papers)
  4. Ying Wen (75 papers)
  5. Dakuo Wang (87 papers)
  6. Xinbing Wang (98 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.