Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

FCBench: Cross-Domain Benchmarking of Lossless Compression for Floating-Point Data (2312.10301v2)

Published 16 Dec 2023 in cs.DB

Abstract: While both the database and high-performance computing (HPC) communities utilize lossless compression methods to minimize floating-point data size, a disconnect persists between them. Each community designs and assesses methods in a domain-specific manner, making it unclear if HPC compression techniques can benefit database applications or vice versa. With the HPC community increasingly leaning towards in-situ analysis and visualization, more floating-point data from scientific simulations are being stored in databases like Key-Value Stores and queried using in-memory retrieval paradigms. This trend underscores the urgent need for a collective study of these compression methods' strengths and limitations, not only based on their performance in compressing data from various domains but also on their runtime characteristics. Our study extensively evaluates the performance of eight CPU-based and five GPU-based compression methods developed by both communities, using 33 real-world datasets assembled in the Floating-point Compressor Benchmark (FCBench). Additionally, we utilize the roofline model to profile their runtime bottlenecks. Our goal is to offer insights into these compression methods that could assist researchers in selecting existing methods or developing new ones for integrated database and HPC applications.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Xinyu Chen (65 papers)
  2. Jiannan Tian (30 papers)
  3. Ian Beaver (6 papers)
  4. Cynthia Freeman (4 papers)
  5. Yan Yan (242 papers)
  6. Jianguo Wang (62 papers)
  7. Dingwen Tao (60 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.