Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Efficient Algorithms for Top-k Stabbing Queries on Weighted Interval Data (Full Version) (2405.05601v2)

Published 9 May 2024 in cs.DB

Abstract: Intervals have been generated in many applications (e.g., temporal databases), and they are often associated with weights, such as prices. This paper addresses the problem of processing top-k weighted stabbing queries on interval data. Given a set of weighted intervals, a query value, and a result size $k$, this problem finds the $k$ intervals that are stabbed by the query value and have the largest weights. Although this problem finds practical applications (e.g., purchase, vehicle, and cryptocurrency analysis), it has not been well studied. A state-of-the-art algorithm for this problem incurs $O(n\log k)$ time, where $n$ is the number of intervals, so it is not scalable to large $n$. We solve this inefficiency issue and propose an algorithm that runs in $O(\sqrt{n }\log n + k)$ time. Furthermore, we propose an $O(\log n + k)$ algorithm to further accelerate the search efficiency. Experiments on two real large datasets demonstrate that our algorithms are faster than existing algorithms.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (19)
  1. An Optimal Dynamic Interval Stabbing-max Data Structure?. In SODA. 803–812.
  2. Daichi Amagata. 2024a. Independent Range Sampling on Interval Data. In ICDE. 449–461.
  3. Daichi Amagata. 2024b. Independent Range Sampling on Interval Data (Longer Version). arXiv:2405.08315 (2024).
  4. Daichi Amagata and Takahiro Hara. 2017. Mining Top-k Co-Occurrence Patterns across Multiple Streams. IEEE Transactions on Knowledge and Data Engineering 29, 10 (2017), 2249–2262.
  5. Sliding window top-k dominating query processing over distributed data streams. Distributed and Parallel Databases 34 (2016), 535–566.
  6. Efficient Algorithms for Top-k Stabbing Queries on Weighted Interval Data. In DEXA.
  7. Period Index: A Learned 2d Hash Index for Range and Duration Queries. In SSTD. 100–109.
  8. Hint: A Hierarchical Index for Intervals in Main Memory. In SIGMOD. 1257–1270.
  9. HINT: a Hierarchical interval index for Allen relationships. The VLDB Journal (2023), 1–28.
  10. Mark De Berg. 2000. Computational Geometry: Algorithms and Applications.
  11. Herbert Edelsbrunner. 1980. Dynamic Rectangle Intersection Searching.
  12. SAP HANA Database: Data Management for Modern Business Applications. SIGMOD Record 40, 4 (2012), 45–51.
  13. Dynamic rectangular intersection with priorities. In STOC. 639–648.
  14. Timeline Index: a Unified Data Structure for Processing Queries on Temporal Data in SAP HANA. In SIGMOD. 1173–1184.
  15. Lamps: Location-Aware Moving Top-k Pub/Sub. IEEE Transactions on Knowledge & Data Engineering 34, 01 (2022), 352–364.
  16. Range Thresholding on Streams. In SIGMOD. 571–582.
  17. Jianqiu Xu and Hua Lu. 2017. Efficiently answer top-k queries on typed intervals. Information Systems 71 (2017), 164–181.
  18. In-memory big data management and processing: A survey. IEEE Transactions on Knowledge and Data Engineering 27, 7 (2015), 1920–1948.
  19. Approximate Range Thresholding. In SIGMOD. 1108–1121.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com