Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Cohesiveness Relationships to Empower Keyword Search on Tree Data on the Web (1508.04957v1)

Published 20 Aug 2015 in cs.DB

Abstract: Keyword search is the most popular querying technique on semistructured data. Keyword queries are simple and con- venient. However, as a consequence of their imprecision, the quality of their answers is poor and the existing algorithms do not scale satisfactorily. In this paper, we introduce the novel concept of cohesive keyword queries for tree data. Intuitively, a cohesiveness relationship on keywords indicates that they should form a cohesive whole in a query result. Cohesive keyword queries allow term nesting and keyword repetition. Although more expressive, they are as simple as flat keyword queries. We provide formal semantics for cohesive keyword queries rank- ing query results on the proximity of the keyword instances. We design a stack based algorithm which efficiently evaluates cohesive keyword queries. Our experiments demonstrate that our approach outperforms in quality previous filtering semantics and our algorithm scales smoothly on queries of even 20 keywords on large datasets.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Aggeliki Dimitriou (1 paper)
  2. Ananya Dass (1 paper)
  3. Dimitri Theodoratos (3 papers)
Citations (1)