Deep Dict: Deep Learning-based Lossy Time Series Compressor for IoT Data (2401.10396v1)
Abstract: We propose Deep Dict, a deep learning-based lossy time series compressor designed to achieve a high compression ratio while maintaining decompression error within a predefined range. Deep Dict incorporates two essential components: the Bernoulli transformer autoencoder (BTAE) and a distortion constraint. BTAE extracts Bernoulli representations from time series data, reducing the size of the representations compared to conventional autoencoders. The distortion constraint limits the prediction error of BTAE to the desired range. Moreover, in order to address the limitations of common regression losses such as L1/L2, we introduce a novel loss function called quantized entropy loss (QEL). QEL takes into account the specific characteristics of the problem, enhancing robustness to outliers and alleviating optimization challenges. Our evaluation of Deep Dict across ten diverse time series datasets from various domains reveals that Deep Dict outperforms state-of-the-art lossy compressors in terms of compression ratio by a significant margin by up to 53.66%.
- M. Stoyanova, Y. Nikoloudakis, S. Panagiotakis, E. Pallis, and E. K. Markakis, “A Survey on the Internet of Things (IoT) Forensics: Challenges, Approaches, and Open Issues,” IEEE Communications Surveys & Tutorials, vol. 22, no. 2, pp. 1191–1221, 2020, conference Name: IEEE Communications Surveys & Tutorials.
- A. Nauman, Y. A. Qadri, M. Amjad, Y. B. Zikria, M. K. Afzal, and S. W. Kim, “Multimedia Internet of Things: A Comprehensive Survey,” IEEE Access, vol. 8, pp. 8202–8250, 2020, conference Name: IEEE Access.
- T. Wong and Z. Luo, “Recurrent Auto-Encoder Model for Large-Scale Industrial Sensor Signal Analysis,” in Engineering Applications of Neural Networks, ser. Communications in Computer and Information Science, E. Pimenidis and C. Jayne, Eds. Cham: Springer International Publishing, 2018, pp. 203–216.
- T. Buddhika, M. Malensek, S. Pallickara, and S. L. Pallickara, “Living on the edge: Data transmission, storage, and analytics in continuous sensing environments,” ACM Trans. Internet Things, vol. 2/3, jul 2021.
- S. K. Jensen, T. B. Pedersen, and C. Thomsen, “Time Series Management Systems: A Survey,” IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 11, pp. 2581–2600, Nov. 2017, conference Name: IEEE Transactions on Knowledge and Data Engineering.
- G. Chiarot and C. Silvestri, “Time series compression: a survey,” Jan. 2021, number: arXiv:2101.08784 arXiv:2101.08784 [cs]. [Online]. Available: http://arxiv.org/abs/2101.08784
- S. Jin, S. Di, X. Liang, J. Tian, D. Tao, and F. Cappello, “DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression,” in 28th International Symposium on High-Performance Parallel and Distributed Computing, ser. HPDC ’19. New York, NY, USA: ACM, 2019, pp. 159–170.
- D. Bank, N. Koenigstein, and R. Giryes, “Autoencoders,” Apr. 2021, arXiv:2003.05991 [cs, stat]. [Online]. Available: http://arxiv.org/abs/2003.05991
- S. Chandak, K. Tatwawadi, C. Wen, L. Wang, J. Aparicio Ojea, and T. Weissman, “LFZip: Lossy Compression of Multivariate Floating-Point Time Series Data via Improved Prediction,” in 2020 Data Compression Conference (DCC), Mar. 2020, pp. 342–351, iSSN: 2375-0359.
- A. Kumar, Z. Wang, and A. Srivastava, “A novel approach for classification in resource-constrained environments,” ACM Trans. Internet Things, vol. 3/4, sep 2022.
- X. Liang, S. Di, D. Tao, S. Li, S. Li, H. Guo, Z. Chen, and F. Cappello, “Error-Controlled Lossy Compression Optimized for High Compression Ratios of Scientific Datasets,” in 2018 IEEE International Conference on Big Data (Big Data), 2018, pp. 438–447.
- K. Zhao, S. Di, M. Dmitriev, T.-L. D. Tonellot, Z. Chen, and F. Cappello, “Optimizing Error-Bounded Lossy Compression for Scientific Data by Dynamic Spline Interpolation,” in 2021 IEEE 37th International Conference on Data Engineering (ICDE), Apr. 2021, pp. 1643–1654, iSSN: 2375-026X.
- C.-Z. A. Huang, A. Vaswani, J. Uszkoreit, N. Shazeer, I. Simon, C. Hawthorne, A. M. Dai, M. D. Hoffman, M. Dinculescu, and D. Eck, “Music Transformer,” Dec. 2018, arXiv:1809.04281 [cs, eess, stat]. [Online]. Available: http://arxiv.org/abs/1809.04281
- I. Grebnov, “IlyaGrebnov/libbsc,” May 2022, original-date: 2011-05-11T06:39:49Z. [Online]. Available: https://github.com/IlyaGrebnov/libbsc