Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Anfrage-getriebener Wissenstransfer zur Unterstuetzung von Datenanalysten (1610.06382v1)

Published 20 Oct 2016 in cs.DB and cs.IR

Abstract: In larger organizations, multiple teams of data scientists have to integrate data from heterogeneous data sources as preparation for data analysis tasks. Writing effective analytical queries requires data scientists to have in-depth knowledge of the existence, semantics, and usage context of data sources. Once gathered, such knowledge is informally shared within a specific team of data scientists, but usually is neither formalized nor shared with other teams. Potential synergies remain unused. We therefore introduce a novel approach which extends data management systems with additional knowledge-sharing capabilities to facilitate user collaboration without altering established data analysis workflows. Relevant collective knowledge from the query log is extracted to support data source discovery and incremental data integration. Extracted knowledge is formalized and provided at query time.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Andreas M. Wahl (2 papers)
  2. Gregor Endler (1 paper)
  3. Peter K. Schwab (1 paper)
  4. Sebastian Herbst (2 papers)
  5. Richard Lenz (6 papers)

Summary

We haven't generated a summary for this paper yet.