Quantixar: High-performance Vector Data Management System (2403.12583v1)
Abstract: Traditional database management systems need help efficiently represent and querying the complex, high-dimensional data prevalent in modern applications. Vector databases offer a solution by storing data as numerical vectors within a multi-dimensional space. This enables similarity-based search and analysis, such as image retrieval, recommendation engine generation, and natural language processing. This paper introduces Quantixar, a vector database project designed for efficiency in high-dimensional settings. Quantixar tackles the challenge of managing high-dimensional data by strategically combining advanced indexing and quantization techniques. It employs HNSW indexing for accelerated ANN search. Additionally, Quantixar incorporates binary and product quantization to compress high-dimensional vectors, reducing storage requirements and computational costs during search. The paper delves into Quantixar's architecture, specific implementation, and experimental methodology.
- Zhong, Y. Efficient Similarity Indexing and Searching in High Dimensions. (2015)
- Russinoff, D. SSE Floating-Point Instructions. Formal Verification Of Floating-Point Hardware Design: A Mathematical Approach. pp. 241-246 (2021)
- Jokela, S. & Others Metadata enhanced content management in media companies. (Helsinki University of Technology,2001)
- Nalawala, H., Shah, J., Agrawal, S. & Oza, P. A comprehensive study of “etcd”—an open-source distributed key-value store with relevant distributed databases. Emerging Technologies For Computing, Communication And Smart Cities: Proceedings Of ETCCS 2021. pp. 481-489 (2022)