Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

How to Optimize the Environmental Impact of Transformed NoSQL Schemas through a Multidimensional Cost Model? (2311.15406v2)

Published 26 Nov 2023 in cs.DB

Abstract: The complexity of database systems has increased significantly along with the continuous growth of data, resulting in NoSQL systems and forcing Information Systems (IS) architects to constantly adapt their data models (i.e., the data structure of information stored in the database) and carefully choose the best option(s) for storing and managing data. In this context, we propose %in this paper an automatic global approach for leading data models' transformation process. This approach starts with the generation of all possible solutions. It then relies on a cost model that helps to compare these generated data models in a logical level to finally choose the best one for the given use case. This cost model integrates both data model and queries cost. It also takes into consideration the environmental impact of a data model as well as its financial and its time costs. This work presents for the first time a multidimensional cost model encompassing time, environmental and financial constraints, which compares data models leading to the choice of the optimal one for a given use case. In addition, a simulation for data model's transformation and cost computation has been developed based on our approach.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. MDA-based Approach for NoSQL Databases Modelling. In DAWAK’17. Springer, 88–102.
  2. Foto N. Afrati and Jeffrey D. Ullman. 2010. Optimizing Joins in a Map-Reduce Environment. In Proceedings of the 13th International Conference on Extending Database Technology (Lausanne, Switzerland) (EDBT ’10). Association for Computing Machinery, New York, NY, USA, 99–110. https://doi.org/10.1145/1739041.1739056
  3. Amer F. Al-Badarneh and Salahaldeen Atef Rababa. 2022. An analysis of two-way equi-join algorithms under MapReduce. Journal of King Saud University - Computer and Information Sciences 34, 4 (2022), 1074–1085. https://doi.org/10.1016/j.jksuci.2020.05.004
  4. A Comparison of Join Algorithms for Log Processing in MaPreduce. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data (Indianapolis, Indiana, USA) (SIGMOD ’10). Association for Computing Machinery, New York, NY, USA, 975–986. https://doi.org/10.1145/1807167.1807273
  5. A Big Data Modeling Methodology for Apache Cassandra. In ICBD’15. IEEE, 238–245.
  6. UMLtoGraphDB: mapping conceptual schemas to graph databases. In Conceptual Modeling (ER’16). 430–444.
  7. Conceptual Mappings to Convert Relational into NoSQL Databases. In ICEIS’16. 174–181.
  8. Mortadelo: Automatic generation of NoSQL stores from platform-independent data models. Future Generation Computer Systems 105 (2020), 455–474.
  9. Jeffrey Dean and Sanjay Ghemawat. 2004. MapReduce: Simplified Data Processing on Large Clusters (OSDI’04). USENIX Association, USA, 10.
  10. Chasing Carbon: The Elusive Environmental Footprint of Computing. IEEE Micro 42, 4 (jul 2022), 37–47. https://doi.org/10.1109/MM.2022.3163226
  11. David Guyon. 2018. Supporting energy-awareness for cloud users. Ph. D. Dissertation. http://www.theses.fr/2018REN1S037/document 2018REN1S037.
  12. Transforming UML class diagrams into HBase based on meta-model. In ISEEE’14, Vol. 2. 720–724.
  13. Improving the energy efficiency of relational and NoSQL databases via query optimizations. Sustainable Computing: Informatics and Systems 22 (2019), 120–133.
  14. A Global Model-Driven Denormalization Approach for Schema Migration. In International Conference on Research Challenges in Information Science. Springer, 529–545.
  15. ModelDrivenGuide: An Approach for Implementing NoSQL Schemas. In DEXA’20. Springer, 141–151.
  16. M Tamer Özsu and Patrick Valduriez. 1999. Principles of distributed database systems. Vol. 2. Springer.
  17. A framework for migrating relational datasets to NoSQL. Procedia Computer Science 51 (2015), 2593–2602.
  18. Estimating power/energy consumption in database servers. Procedia Computer Science 6 (2011), 112–117.
  19. An economic energy approach for queries on data centers. (2017).
  20. Rami Sellami and Bruno Defude. 2017. Complex queries optimization and evaluation over relational and NoSQL data stores in cloud environments. IEEE transactions on big data 4, 2 (2017), 217–230.
  21. Alan J. Smith. 1985. Disk Cache—miss Ratio Analysis and Design Considerations. ACM Trans. Comput. Syst. 3, 3 (aug 1985), 161–203. https://doi.org/10.1145/3959.3961
  22. Swamit Tannu and Prashant J. Nair. 2022. The Dirty Secret of SSDs: Embodied Carbon. arXiv:2207.10793 [cs.AR]
  23. Comparative study of the new generation, agile, scalable, high performance NoSQL databases. International Journal of Computer Applications 48, 20 (2012), 1–4.
  24. Sanjeev Thakur and Ankur Chaurasia. 2016. Towards Green Cloud Computing: Impact of carbon footprint on environment. In 2016 6th international conference-cloud system and big data engineering (Confluence). IEEE, 209–213.
  25. Denormalizing data into schema-free databases. In CogInfoCom’13. IEEE, 747–752.
Citations (1)

Summary

We haven't generated a summary for this paper yet.