From Cloud to Edge: Rethinking Generative AI for Low-Resource Design Challenges
Abstract: Generative AI has shown tremendous prospects in all aspects of technology, including design. However, due to its heavy demand on resources, it is usually trained on large computing infrastructure and often made available as a cloud-based service. In this position paper, we consider the potential, challenges, and promising approaches for generative AI for design on the edge, i.e., in resource-constrained settings where memory, compute, energy (battery) and network connectivity may be limited. Adapting generative AI for such settings involves overcoming significant hurdles, primarily in how to streamline complex models to function efficiently in low-resource environments. This necessitates innovative approaches in model compression, efficient algorithmic design, and perhaps even leveraging edge computing. The objective is to harness the power of generative AI in creating bespoke solutions for design problems, such as medical interventions, farm equipment maintenance, and educational material design, tailored to the unique constraints and needs of remote areas. These efforts could democratize access to advanced technology and foster sustainable development, ensuring universal accessibility and environmental consideration of AI-driven design benefits.
- Flamingo: a visual language model for few-shot learning. Advances in Neural Information Processing Systems, 35: 23716–23736.
- BriteLab, P. 2023. Greenhouse Grid Design Task. Accessed: 11-28-2023.
- Artificial Intelligence-Based Fault Diagnosis and Prediction for Smart Farm Information and Communication Technology Equipment. Agriculture, 13(11): 2124.
- LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale. arXiv:2208.07339.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv:1810.04805.
- Tinyml meets iot: A comprehensive survey. Internet of Things, 16: 100461.
- A Survey of Quantization Methods for Efficient Neural Network Inference. arXiv:2103.13630.
- ImageBind: One Embedding Space To Bind Them All. In CVPR.
- Knowledge Distillation of Large Language Models. arXiv:2306.08543.
- Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes. arXiv:2305.02301.
- Jain, M. 2023. Center for Creative Learning - Toys and Activities. In https://www.ccl.iitgn.ac.in/toys.
- Developing world users as lead users: a case study in engineering reverse innovation. Journal of Mechanical Design, 137(7): 071406.
- Techology trend of edge AI. In 2018 International Symposium on VLSI Design, Automation and Test (VLSI-DAT), 1–2.
- Pushing Large Language Models to the 6G Edge: Vision, Challenges, and Opportunities. arXiv:2309.16739.
- LLM-QAT: Data-Free Quantization Aware Training for Large Language Models. arXiv:2305.17888.
- LLM-Pruner: On the Structural Pruning of Large Language Models. arXiv:2305.11627.
- Simulating the Adoption and Social Impact of Improved Cookstoves in Uganda Using Agent-Based Modeling and Neural Networks. Journal of Mechanical Design, 145(12).
- Edge AI: Systems Design and ML for IoT Data Analytics. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD ’20, 3565–3566. New York, NY, USA: Association for Computing Machinery. ISBN 9781450379984.
- Why the developing world needs mechanical design. Journal of Mechanical Design, 138(7): 070301.
- Nine principles for design for the developing world as derived from the engineering literature. Journal of Mechanical Design, 136(12): 121403.
- Machine Learning at the Network Edge: A Survey. ACM Computing Surveys, 54(8): 1–37.
- OpenAI. 2023. GPT-4 Technical Report. arXiv:2303.08774.
- Training language models to follow instructions with human feedback. arXiv:2203.02155.
- From Concept to Manufacturing: Evaluating Vision-Language Models for Engineering Design. arXiv:2311.12668.
- SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis. arXiv:2307.01952.
- Is TinyML Sustainable? Assessing the Environmental Impacts of Machine Learning on Microcontrollers. arXiv preprint arXiv:2301.11899.
- Zero-Shot Text-to-Image Generation. arXiv:2102.12092.
- Ray, P. P. 2022. A review on TinyML: State-of-the-art and prospects. Journal of King Saud University-Computer and Information Sciences, 34(4): 1595–1623.
- Deep generative models in engineering design: A review. Journal of Mechanical Design, 144(7): 071704.
- Beyond statistical similarity: Rethinking metrics for deep generative models in engineering design. arXiv preprint arXiv:2302.02913.
- High-Resolution Image Synthesis with Latent Diffusion Models. arXiv:2112.10752.
- DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv:1910.01108.
- DesIGN: Design Inspiration from Generative Networks. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops.
- Edge AI: a survey. Internet of Things and Cyber-Physical Systems.
- Combining direct and indirect user data for calculating social impact indicators of products in developing countries. Journal of Mechanical Design, 142(12): 121401.
- A Simple and Effective Pruning Approach for Large Language Models. arXiv:2306.11695.
- Any-to-Any Generation via Composable Diffusion. arXiv preprint arXiv:2305.11846.
- LLaMA: Open and Efficient Foundation Language Models. Cite arxiv:2302.13971.
- Assessing the impact of generative AI on medicinal chemistry. Nature biotechnology, 38(2): 143–145.
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arXiv:2201.11903.
- Design for the developing world: Common pitfalls and how to avoid them. Journal of Mechanical Design, 138(3): 031101.
- Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models. arXiv:2303.04671.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.