Managing Data Replication and Distribution in the Fog with FReD (2303.05256v2)
Abstract: The heterogeneous, geographically distributed infrastructure of fog computing poses challenges in data replication, data distribution, and data mobility for fog applications. Fog computing is still missing the necessary abstractions to manage application data, and fog application developers need to re-implement data management for every new piece of software. Proposed solutions are limited to certain application domains, such as the IoT, are not flexible in regard to network topology, or do not provide the means for applications to control the movement of their data. In this paper, we present FReD, a data replication middleware for the fog. FReD serves as a building block for configurable fog data distribution and enables low-latency, high-bandwidth, and privacy-sensitive applications. FReD is a common data access interface across heterogeneous infrastructure and network topologies, provides transparent and controllable data distribution, and can be integrated with applications from different domains. To evaluate our approach, we present a prototype implementation of FReD and show the benefits of developing with FReD using three case studies of fog computing applications.
- Daniel Abadi. 2012. Consistency Tradeoffs in Modern Distributed Database System Design: CAP is Only Part of the Story. Computer 45, 2 (Jan. 2012), 37–42. https://doi.org/10.1109/MC.2012.33
- Fog Computing for the Internet of Things: Security and Privacy Issues. IEEE Internet Computing 21, 2 (March 2017), 34–42. https://doi.org/10.1109/MIC.2017.37
- Predictive Replica Placement for Mobile Users in Distributed Fog Data Stores with Client-Side Markov Models. In Proceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing Companion (Leicester, United Kingdom) (UCC ’21). Association for Computing Machinery, New York, NY, USA, 1–8. https://doi.org/10.1145/3492323.3495595
- Metastorage: A Federated Cloud Storage System to Manage Consistency-Latency Tradeoffs. In Proceedings of the IEEE International Conference on Cloud Computing (Washington, DC, USA) (CLOUD 2011). IEEE, New York, NY, USA, 452–459. https://doi.org/10.1109/CLOUD.2011.62
- A Middleware Guaranteeing Client-Centric Consistency on Top of Eventually Consistent Datastores. In Proceedings of the 1st IEEE International Conference on Cloud Engineering (San Francisco, CA, USA) (IC2E 2013). IEEE, New York, NY, USA, 114–123. https://doi.org/10.1109/IC2E.2013.32
- Towards Grassroots Peering at the Edge. In Proceedings of the 8th Workshop on Middleware and Applications for the Internet of Things (Virtual Event, Canada) (M4IoT ’21). Association for Computing Machinery, New York, NY, USA, 14–17. https://doi.org/10.1145/3493369.3493602
- A Research Perspective on Fog Computing. In Proceedings of the 2nd Workshop on IoT Systems Provisioning & Management for Context-Aware Smart Cities (Malaga, Spain) (ISYCC 2017). Springer, Cham, Switzerland, 198–210. https://doi.org/10.1007/978-3-319-91764-1_16
- Fog Computing and its Role in the Internet of Things. In Proceedings of the First Edition of the MCC Workshop on Mobile Cloud Computing (Helsinki, Finland) (MCC ’12). Association for Computing Machinery, New York, NY, USA, 13–16. https://doi.org/10.1145/2342509.2342513
- Performance Analysis of Object Store Systems in a Fog/Edge Computing Infrastructures. In Proceedings of the 2016 IEEE International Conference on Cloud Computing Technology and Science (Luxembourg, Luxembourg) (CloudCom). IEEE, New York, NY, USA, 294–301. https://doi.org/10.1109/CloudCom.2016.0055
- An Object Store Service for a Fog/Edge Computing Infrastructure Based on IPFS and a Scale-Out NAS. In Proceedings of the 2017 IEEE 1st International Conference on Fog and Edge Computing (Madrid, Spain) (ICFEC). IEEE, New York, NY, USA, 41–50. https://doi.org/10.1109/ICFEC.2017.13
- A Fog Storage Software Architecture for the Internet of Things. In Advances in Edge Computing: Massive Parallel Processing and Applications. IOS Press.
- Amir Vahid Dastjerdi and Rajkumar Buyya. 2016. Fog Computing: Helping the Internet of Things Realize Its Potential. Computer 49, 8 (Aug. 2016), 112–116. https://doi.org/10.1109/MC.2016.245
- Dynamo: Amazon’s Highly Available Key-value Store. Operating Systems Review 41, 6 (Oct. 2007), 205–220. https://doi.org/10.1145/1323293.1294281
- Above the Clouds: A Berkeley View of Cloud Computing. Technical Report. Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Rep. UCB/EECS, Berkeley, CA, USA.
- Challenges and Opportunities for Efficient Serverless Computing at the Edge. In Proceedings of the 2019 38th Symposium on Reliable Distributed Systems (Lyon, France) (SRDS). IEEE, New York, NY, USA, 261–2615. https://doi.org/10.1109/SRDS47363.2019.00036
- NanoLambda: Implementing Functions as a Service at All Resource Scales for the Internet of Things. In Proceedings of the 2020 IEEE/ACM Symposium on Edge Computing (San Jose, CA, USA) (SEC ’20). IEEE, New York, NY, USA, 220–231. https://doi.org/10.1109/SEC50012.2020.00035
- The Google File Systemn. In Proceedings of the nineteenth ACM symposium on Operating systems principles (Bolton Landing, NY, USA) (SOSP ’03). Association for Computing Machinery, New York, NY, USA, 29–43. https://doi.org/10.1145/945445.945450
- Public Video Surveillance: Using the Fog to Increase Privacy. In Proceedings of the 5th Workshop on Middleware and Applications for the Internet of Things (Rennes, France) (M4IoT ’18). Association for Computing Machinery, New York, NY, USA, 11–14. https://doi.org/10.1145/3286719.3286722
- Harshit Gupta and Umakishore Ramachandran. 2018. FogStore: A Geo-Distributed Key-Value Store Guaranteeing Low Latency for Strongly Consistent Access. In Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems (Hamilton, New Zealand) (DEBS ’18). Association for Computing Machinery, New York, NY, USA, 148–159. https://doi.org/10.1145/3210284.3210297
- Datafog: Towards a Holistic Data Management Platform for the IoT Age at the Network Edge. In Proceedings of the USENIX Workshop on Hot Topics in Edge Computing (Boston, MA, USA) (HotEdge ’18). USENIX Association, Berkeley, CA, USA.
- Jonathan Hasenburg and David Bermbach. 2019. Towards Geo-Context Aware IoT Data Distribution. In Proceedings of the 4th Workshop on IoT Systems Provisioning & Management for Context-Aware Smart Cities (Toulouse, France) (ISYCC 2019). Springer, Cham, Switzerland, 111–121. https://doi.org/10.1007/978-3-030-45989-5_9
- Towards A Replication Service for Data-Intensive Fog Applications. In Proceedings of the 35th ACM Symposium on Applied Computing, Posters Track (Brno, Czech Republic) (SAC ’20). Association for Computing Machinery, New York, NY, USA, 267–270. https://doi.org/10.1145/3341105.3374060
- Managing Latency and Excess Data Dissemination in Fog-Based Publish/Subscribe Systems. In Proceedings of the Second IEEE International Conference on Fog Computing (Sydney, NSW, Australia) (ICFC 2020). IEEE, New York, NY, USA, 9–16. https://doi.org/10.1109/ICFC49376.2020.00010
- Rearchitecting Kubernetes for the Edge. In Proceedings of the 4th International Workshop on Edge Systems, Analytics and Networking (Online, United Kingdom) (EdgeSys ’21). Association for Computing Machinery, New York, NY, USA, 7–12. https://doi.org/10.1145/3434770.3459730
- OceanStore: An Architecture for Global-scale Persistent Storage. Operating Systems Review 28, 5 (Nov. 2000), 190–201. https://doi.org/10.1145/384264.379239
- Cloud Federation. In Proceedings of the 2nd International Conference on Cloud Computing, GRIDs, and Virtualization (Rome, Italy) (CLOUD COMPUTING 2011). IARIA, 32–38.
- Avinash Lakshman and Prashant Malik. 2010. Cassandra: A Decentralized Structured Storage System. Operating Systems Review 44, 2 (April 2010), 35–40. https://doi.org/10.1145/1773912.1773922
- Enhancing Edge Computing with Database Replication. In Proceedings of the 26th IEEE International Symposium on Reliable Distributed Systems (Beijing, China) (SRDS 2007). IEEE, New York, NY, USA, 45–54. https://doi.org/10.1109/SRDS.2007.10
- Kinesis: A New Approach to Replica Placement in Distributed Storage Systems. Transactions on Storage 4, 4, Article 11 (Feb. 2009), 28 pages. https://doi.org/10.1145/1480439.1480440
- FogStore: Toward a Distributed Data Store for Fog Computing. In Proceedings of the 2017 IEEE Fog World Congress (Santa Clara, CA, USA) (FWC). IEEE, New York, NY, USA, 1–6. https://doi.org/10.1109/FWC.2017.8368524
- Petar Maymounkov and David Mazières. 2002. Kademlia: A Peer-to-Peer Information System Based on the XOR Metric. In Proceedings of the International Workshop on Peer-to-Peer Systems (Cambridge, MA, USA) (IPTPS 2002). Springer, Berlin, Germany, 53–65. https://doi.org/10.1007/3-540-45748-8_5
- Pruning Edge Research with Latency Shears. In Proceedings of the 19th ACM Workshop on Hot Topics in Networks (Virtual Event, USA) (HotNets ’20). Association for Computing Machinery, New York, NY, USA, 182–189. https://doi.org/10.1145/3422604.3425943
- Toward a Global Data Infrastructure. IEEE Internet Computing 20, 3 (March 2016), 54–62. https://doi.org/10.1109/MIC.2016.51
- SessionStore: A Session-Aware Datastore for the Edge. In Proceedings of the 2020 IEEE 4th International Conference on Fog and Edge Computing (Melbourne, Australia) (ICFEC). IEEE, New York, NY, USA, 59–68. https://doi.org/10.1109/ICFEC50348.2020.00014
- Feather: Hierarchical Querying for the Edge. In Proceedings of the 2020 IEEE/ACM Symposium on Edge Computing (SEC) (Virtual Event). New York, NY, USA, 271–284.
- Cloudpath: A Multi-Tier Cloud Computing Framework. In Proceedings of the Second ACM/IEEE Symposium on Edge Computing (San Jose, CA, USA) (SEC ’17). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3132211.3134464
- Mohammed Islam Naas. 2019. iFogStorC: a Heuristic for Managing IoT Data Replication Storage and Consistency in a Fog Infrastructures. In Proceedings of the 5th Workshop on Performance and Scalability of Storage Systems (INRIA Bordeaux, Talence, France) (Per3S).
- iFogStor: An IoT Data Placement Strategy for Fog Infrastructure. In Proceedings of the 2017 IEEE 1st International Conference on Fog and Edge Computing (Madrid, Spain) (ICFEC). IEEE, New York, NY, USA, 97–104. https://doi.org/10.1109/ICFEC.2017.15
- Data Management in Fog Computing. In Fog and Edge Computing: Principles and Paradigms. Wiley, 171–190.
- Fog Computing as Privacy Enabler. IEEE Internet Computing 24, 4 (March 2020), 15–21. https://doi.org/10.1109/MIC.2020.2979161
- Tobias Pfandzelter and David Bermbach. 2019. IoT Data Processing in the Fog: Functions, Streams, or Batch Processing?. In Proceedings of the 1st Workshop on Efficient Data Movement in Fog Computing (Prague, Czech Republic) (DaMove 2019). IEEE, New York, NY, USA, 201–206. https://doi.org/10.1109/ICFC.2019.00033
- Tobias Pfandzelter and David Bermbach. 2020. tinyFaaS: A Lightweight FaaS Platform for Edge Environments. In Proceedings of the Second IEEE International Conference on Fog Computing (Sydney, NSW, Australia) (ICFC 2020). IEEE, New York, NY, USA, 17–24. https://doi.org/10.1109/ICFC49376.2020.00011
- Tobias Pfandzelter and David Bermbach. 2021a. Edge (of the Earth) Replication: Optimizing Content Delivery in Large LEO Satellite Communication Networks. In Proceedings of the 21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (Melbourne, Australia) (CCGrid ’21). IEEE, New York, NY, USA, 565–575. https://doi.org/10.1109/CCGrid51090.2021.00066
- Tobias Pfandzelter and David Bermbach. 2021b. Towards Predictive Replica Placement for Distributed Data Stores in Fog Environments. In Proceedings of the 9th IEEE International Conference on Cloud Engineering, Posters (San Francisco, CA, USA) (IC2E 2021). IEEE, New York, NY, USA, 280–281. https://doi.org/10.1109/IC2E52221.2021.00047
- From Zero to Fog: Efficient Engineering of Fog-Based Internet of Things Applications. Software: Practice and Experience 51, 8 (June 2021), 1798–1821. https://doi.org/10.1002/spe.3003
- Towards Distributed Coordination for Fog Platforms. In Proceedings of the 22nd IEEE/ACM international Symposium on Cluster, Cloud and Internet Computing, Posters (Taormina, Italy) (CCGrid 2021). IEEE, New York, NY, USA, 760–762. https://doi.org/10.1109/CCGrid54584.2022.00087
- Information Logistics and Fog Computing: The DITAS Approach. In Proceedings of the Forum and Doctoral Consortium Papers Presented at the 29th International Conference on Advanced Information Systems Engineering (Essen, Germany) (CAiSE 2017). CEUR Workshop Proceedings, Aachen, Germany, 129–136.
- Mobile Data Repositories at the Edge. In Proceedings of the USENIX Workshop on Hot Topics in Edge Computing (Boston, MA, USA) (HotEdge ’18). USENIX Association, Berkeley, CA, USA.
- Fang-Yu Rao and Elisa Bertino. 2019. Privacy Techniques for Edge Computing Systems. Procedings of the IEEE 107, 8 (June 2019), 1632–1654. https://doi.org/10.1109/JPROC.2019.2918749
- Synthesizing Plausible Infrastructure Configurations for Evaluating Edge Computing Systems. In Proceedings of the 3rd USENIX Workshop on Hot Topics in Edge Computing (Virtual Event) (HotEdge ’20). USENIX Association, Berkeley, CA, USA.
- Antony Rowstron and Peter Druschel. 2001a. Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems. In Proceedings of the IFIP/ACM International Conference on Distributed Systems Platforms and Open Distributed Processing (Berlin, Germany) (Middleware 2001). Springer, New York, NY, USA, 329–350. https://doi.org/10.1007/3-540-45518-3_18
- Antony Rowstron and Peter Druschel. 2001b. Storage Management and Caching in PAST, a Large-Scale, Persistent Peer-to-Peer Storage Utility. In Proceedings of the Eighteenth ACM Symposium on Operating Systems Principles (Banff, Alberta, Canada) (SOSP ’01). Association for Computing Machinery, New York, NY, USA, 188–201. https://doi.org/10.1145/502034.502053
- Nebula: Distributed Edge Cloud for Data Intensive Computing. In Proceedings of the 2014 IEEE International Conference on Cloud Engineering (Boston, MA, USA) (IC2E). IEEE, New York, NY, USA, 57–66. https://doi.org/10.1109/IC2E.2014.34
- Edge Computing: Vision and Challenges. IEEE Internet of Things Journal 3, 5 (June 2016), 637–646. https://doi.org/10.1109/JIOT.2016.2579198
- Weisong Shi and Schahram Dustdar. 2016. The Promise of Edge Computing. Computer 49, 5 (May 2016), 78–81. https://doi.org/10.1109/MC.2016.145
- Hourglass: An Infrastructure for Connecting Sensor Networks and Applications. Technical Report. Harvard University, Cambridge, MA, USA.
- The Hadoop Distributed File System. In Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (Incline Village, NV, USA) (MSST). IEEE, New York, NY, USA, 1–10. https://doi.org/10.1109/MSST.2010.5496972
- Chord: A Scalable Peer-To-Peer Lookup Service for Internet Applications. SIGCOMM Computer Communication Review 31, 4 (Aug. 2001), 149–160. https://doi.org/10.1145/964723.383071
- Serving Large-Scale Batch Computed Data with Project Voldemort. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST ’12) (San Jose, CA, USA). New York, NY, USA, 18–18.
- Andrew S Tanenbaum and Maarten van Steen. 2017. Distributed Systems: Principles and Paradigms, 3rd ed. https://distributed-systems.net.
- Sharing and Caring of Data at the Edge. In Proceedings of the 3rd USENIX Workshop on Hot Topics in Edge Computing (Online) (HotEdge ’20). USENIX Association, Berkeley, CA, USA.
- Werner Vogels. 2009. Eventually Consistent. Commun. ACM 52, 1 (Jan. 2009), 40–44. https://doi.org/10.1145/1435417.1435432
- Ceph: A Scalable, High-performance Distributed File System. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (Seattle, WA, USA) (OSDI ’06). USENIX Association, Berkeley, CA, USA, 307–320.
- CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Data. In Proceedings of the 2006 ACM/IEEE Conference on Supercomputing (Tampa, FL, USA) (SC ’06). Association for Computing Machinery, New York, NY, USA, 31–31. https://doi.org/10.1145/1188455.1188582
- The Cloud Is Not Enough: Saving IoT From the Cloud. In Proceedings of the 7th USENIX Workshop on Hot Topics in Cloud Computing (Santa Clara, CA, USA) (HotCloud ’15). USENIX Association, Berkeley, CA, USA.