Papers
Topics
Authors
Recent
Search
2000 character limit reached

Pivoting Retail Supply Chain with Deep Generative Techniques: Taxonomy, Survey and Insights

Published 29 Feb 2024 in cs.AI and cs.LG | (2403.00861v1)

Abstract: Generative AI applications, such as ChatGPT or DALL-E, have shown the world their impressive capabilities in generating human-like text or image. Diving deeper, the science stakeholder for those AI applications are Deep Generative Models, a.k.a DGMs, which are designed to learn the underlying distribution of the data and generate new data points that are statistically similar to the original dataset. One critical question is raised: how can we leverage DGMs into morden retail supply chain realm? To address this question, this paper expects to provide a comprehensive review of DGMs and discuss their existing and potential usecases in retail supply chain, by (1) providing a taxonomy and overview of state-of-the-art DGMs and their variants, (2) reviewing existing DGM applications in retail supply chain from a end-to-end view of point, and (3) discussing insights and potential directions on how DGMs can be further utilized on solving retail supply chain problems.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (136)
  1. Data analytics in the supply chain management: Review of machine learning applications in demand forecasting. Operations and Supply Chain Management: An International Journal 14, 1–13.
  2. Gradio: Hassle-free sharing and testing of ml models in the wild. arXiv preprint arXiv:1906.02569 .
  3. Demand forecasting in supply chain: The impact of demand volatility in the presence of promotion. Computers & Industrial Engineering 142, 106380.
  4. Review and comparison of prediction algorithms for the estimated time of arrival using geospatial transportation data. Procedia Computer Science 193, 13–21. doi:10.1016/j.procs.2021.11.003.
  5. Generative models for time series in finance. David, Generative Models for Time Series in Finance (January 31, 2023) .
  6. Learning an inventory control policy with general inventory arrival dynamics. arXiv preprint arXiv:2310.17168 .
  7. Wasserstein generative adversarial networks, in: International conference on machine learning, PMLR. pp. 214–223.
  8. Retail supply chain management. CRC Press.
  9. Demand forecasting in supply chains: a review of aggregation and hierarchical approaches. International Journal of Production Research 60, 324–348.
  10. Tourism demand forecasting under conceptual drift during covid-19: an ensemble deep learning model. Current Issues in Tourism , 1–20.
  11. Learning to solve vehicle routing problems: A survey. arXiv preprint arXiv:2205.02453 .
  12. Deep generative modelling: A comparative review of vaes, gans, normalizing flows, energy-based and autoregressive models. IEEE transactions on pattern analysis and machine intelligence .
  13. The transformer network for the traveling salesman problem. doi:10.48550/ARXIV.2103.03012.
  14. Learning gradient fields for shape generation, in: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16, Springer. pp. 364–381.
  15. The future of retail operations. Manufacturing & Service Operations Management 22, 47–58.
  16. On contrastive divergence learning, in: International workshop on artificial intelligence and statistics, PMLR. pp. 33–40.
  17. Large-scale optimization in online-retail inventory management. Ph.D. thesis. Massachusetts Institute of Technology.
  18. Wavegrad: Estimating gradients for waveform generation. arXiv preprint arXiv:2009.00713 .
  19. Stargan v2: Diverse image synthesis for multiple domains, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 8188–8197.
  20. Scaling instruction-finetuned language models. arXiv preprint arXiv:2210.11416 .
  21. Generative adversarial networks: An overview. IEEE signal processing magazine 35, 53–65.
  22. M6-rec: Generative pretrained language models are open-ended recommender systems. doi:10.48550/ARXIV.2205.08084.
  23. Deep reinforcement learning for inventory optimization with non-stationary uncertain demand. European Journal of Operational Research .
  24. A deep learning-based inventory management and demand prediction optimization method for anomaly detection. Wireless Communications and Mobile Computing 2021, 1–14.
  25. What do llms know about financial markets? a case study on reddit market sentiment analysis, in: Companion Proceedings of the ACM Web Conference 2023, pp. 107–110.
  26. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 .
  27. Diffusion models beat gans on image synthesis. Advances in neural information processing systems 34, 8780–8794.
  28. Generating images with perceptual similarity metrics based on deep networks. Advances in neural information processing systems 29.
  29. Graph transformer with reinforcement learning for vehicle routing problem. IEEJ Transactions on Electrical and Electronic Engineering 18, 701–713. doi:10.1002/tee.23771.
  30. Leveraging large language models in conversational recommender systems. doi:10.48550/ARXIV.2305.07961.
  31. Learning deep sigmoid belief networks with data augmentation, in: Artificial Intelligence and Statistics, PMLR. pp. 268–276.
  32. Chat-rec: Towards interactive and explainable llms-augmented recommender system. doi:10.48550/ARXIV.2303.14524.
  33. Retail supply chain management: a review of theories and practices. Journal of Data, Information and Management 1, 45–64.
  34. Transfer learning in transformer-based demand forecasting for home energy management system. arXiv preprint arXiv:2310.19159 .
  35. Nips 2016 tutorial: Generative adversarial networks. arXiv preprint arXiv:1701.00160 .
  36. Representations of knowledge in complex systems. Journal of the Royal Statistical Society: Series B (Methodological) 56, 549–581.
  37. A review on generative adversarial networks: Algorithms, theory, and applications. IEEE transactions on knowledge and data engineering 35, 3313–3332.
  38. A comprehensive survey and analysis of generative models in machine learning. Computer Science Review 38, 100285.
  39. Training products of experts by minimizing contrastive divergence. Neural computation 14, 1771–1800.
  40. Optimal perceptual inference, in: Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, Citeseer. pp. 448–453.
  41. Denoising diffusion probabilistic models. Advances in neural information processing systems 33, 6840–6851.
  42. Cascaded diffusion models for high fidelity image generation. The Journal of Machine Learning Research 23, 2249–2281.
  43. Applications of deep learning into supply chain management: a systematic literature review and a framework for future research. Artificial Intelligence Review 56, 4447–4489.
  44. Learning vector-quantized item representation for transferable sequential recommenders, in: Proceedings of the ACM Web Conference 2023, pp. 1162–1171.
  45. How to index item ids for recommendation foundation models. arXiv preprint arXiv:2305.06569 .
  46. Image-to-image translation with conditional adversarial networks, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1125–1134.
  47. Product quantization for nearest neighbor search. IEEE transactions on pattern analysis and machine intelligence 33, 117–128.
  48. Retail time series forecasting using an automated deep meta-learning framework. Available at SSRN 4393300 .
  49. An efficient graph convolutional network technique for the travelling salesman problem. arXiv preprint arXiv:1906.01227 .
  50. On learning paradigms for the travelling salesman problem. arXiv preprint arXiv:1910.07210 .
  51. Masked autoencoder for distribution estimation on small structured data sets. IEEE Transactions on Neural Networks and Learning Systems 32, 4997–5007.
  52. Learning combinatorial optimization algorithms over graphs. Advances in neural information processing systems 30.
  53. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 .
  54. Normalizing flows: An introduction and review of current methods. IEEE transactions on pattern analysis and machine intelligence 43, 3964–3979.
  55. Diffwave: A versatile diffusion model for audio synthesis. arXiv preprint arXiv:2009.09761 .
  56. Prediction of electric vehicles charging demand: A transformer-based deep learning approach. Sustainability 15, 2105.
  57. Attention, learn to solve routing problems! arXiv preprint arXiv:1803.08475 .
  58. A tutorial on energy-based learning. Predicting structured data 1.
  59. Solve routing problems with a residual edge-graph attention neural network. Neurocomputing 508, 79–98. doi:10.1016/j.neucom.2022.08.005.
  60. Prompt distillation for efficient llm-based recommendation. doi:10.1145/3583780.3615017.
  61. Large language models for generative recommendation: A survey and visionary discussions. arXiv preprint arXiv:2309.01157 .
  62. Language models for automated market research: A new way to generate perceptual maps. Available at SSRN 4241291 .
  63. An innovative machine learning model for supply chain management. Journal of Innovation & Knowledge 7, 100276.
  64. Exploring generative neural temporal point process. arXiv preprint arXiv:2208.01874 .
  65. Is chatgpt a good recommender? a preliminary study. doi:10.48550/ARXIV.2304.10149.
  66. Llmrec: Benchmarking large language models on recommendation task. doi:10.48550/ARXIV.2308.12241.
  67. End-to-end pareto set prediction with graph neural networks for multi-objective facility location, in: International Conference on Evolutionary Multi-Criterion Optimization, Springer. pp. 147–161.
  68. Deep inventory management. arXiv preprint arXiv:2210.03137 .
  69. Least squares generative adversarial networks, in: Proceedings of the IEEE international conference on computer vision, pp. 2794–2802.
  70. 2021 amazon last mile routing research challenge: Data set. Transportation Science .
  71. Discriminative models, not discriminative training. Technical Report. Technical Report MSR-TR-2005-144, Microsoft Research.
  72. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 .
  73. A comprehensive overview of large language models. arXiv preprint arXiv:2307.06435 .
  74. Conditional image synthesis with auxiliary classifier gans, in: International conference on machine learning, PMLR. pp. 2642–2651.
  75. A new generation? a discussion on deep generative models in supply chains, in: IFIP International Conference on Advances in Production Management Systems, Springer. pp. 444–457.
  76. The potential of generative artificial intelligence across disciplines: perspectives and future directions. Journal of Computer Information Systems , 1–32.
  77. Conditional image generation with pixelcnn decoders. Advances in neural information processing systems 29.
  78. OpenAI, 2023. Gpt-4 technical report. ArXiv abs/2303.08774. URL: https://api.semanticscholar.org/CorpusID:257532815.
  79. Deep generative models: Survey, in: 2018 International conference on intelligent systems and computer vision (ISCV), IEEE. pp. 1–8.
  80. A variational point process model for social event sequences, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 173–180.
  81. Correlation functions and computer simulations. Nuclear Physics B 180, 378–384.
  82. Adaptive inventory replenishment using structured reinforcement learning by exploiting a policy structure. International Journal of Production Economics 266, 109029. doi:10.1016/j.ijpe.2023.109029.
  83. A deep reinforcement learning algorithm using dynamic attention model for vehicle routing problems, in: Artificial Intelligence Algorithms and Applications: 11th International Symposium, ISICA 2019, Guangzhou, China, November 16–17, 2019, Revised Selected Papers 11, Springer. pp. 636–650.
  84. Generative sequential recommendation with gptrec. doi:10.48550/ARXIV.2306.11114.
  85. Grad-tts: A diffusion probabilistic model for text-to-speech, in: International Conference on Machine Learning, PMLR. pp. 8599–8608.
  86. A practical end-to-end inventory management model with deep learning. Management Science 69, 759–773. doi:10.1287/mnsc.2022.4564.
  87. Optimizing an integrated inventory-routing system for multi-item joint replenishment and coordinated outbound delivery using differential evolution algorithm. Applied Soft Computing 86, 105863.
  88. A deep learning attention model to solve the vehicle routing problem and the pick-up and delivery problem with time windows. doi:10.48550/ARXIV.2212.10399.
  89. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 .
  90. Improving language understanding by generative pre-training.
  91. Dall-e: Creating images from text. UGC Care Group I Journal 8, 71–75.
  92. Generative adversarial text to image synthesis, in: International conference on machine learning, PMLR. pp. 1060–1069.
  93. Deep generative models in engineering design: A review. Journal of Mechanical Design 144, 071704.
  94. Survey of eta prediction methods in public transport networks. arXiv preprint arXiv:1904.05037 .
  95. A model architecture for public transport networks using a combination of a recurrent neural network encoder library and a attention mechanism. Algorithms 15, 328. doi:10.3390/a15090328.
  96. Deep boltzmann machines, in: Artificial intelligence and statistics, PMLR. pp. 448–455.
  97. Generative models: An interdisciplinary perspective. Annual Review of Statistics and Its Application 10, 325–352.
  98. Reinforcement learning algorithms and complexity of inventory control, a review, in: MWAIS 2022 Proceedings, p. 6.
  99. Graph-scp: Accelerating set cover problems with graph neural networks. arXiv preprint arXiv:2310.07979 .
  100. Neural temporal point processes: A review. arXiv preprint arXiv:2104.03528 .
  101. Evaluating the impact of health care data completeness for deep generative models. Methods of Information in Medicine 62, 031–039.
  102. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502 .
  103. Generative modeling by estimating gradients of the data distribution. Advances in neural information processing systems 32.
  104. Improved techniques for training score-based generative models. Advances in neural information processing systems 33, 12438–12448.
  105. Sliced score matching: A scalable approach to density and score estimation, in: Uncertainty in Artificial Intelligence, PMLR. pp. 574–584.
  106. How to train your energy-based models. arXiv preprint arXiv:2101.03288 .
  107. Score-based generative modeling through stochastic differential equations. arXiv preprint arXiv:2011.13456 .
  108. One embedder, any task: Instruction-finetuned text embeddings. arXiv preprint arXiv:2212.09741 .
  109. Fma-eta: Estimating travel time entirely based on ffn with attention, in: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE. pp. 3355–3359. doi:10.1109/icassp39728.2021.9414054.
  110. Energy-optimal routing for electric vehicles using deep reinforcement learning with transformer. Applied Energy 350, 121711. doi:10.1016/j.apenergy.2023.121711.
  111. Automated analysis of job market demands using large language model. International Journal of Advanced Computer Science and Applications 14.
  112. Creating large language model applications utilizing langchain: A primer on developing llm apps fast, in: Proceedings of the International Conference on Applied Engineering and Natural Sciences, Konya, Turkey, pp. 10–12.
  113. Recent trends in deep generative models: a review, in: 2018 3rd International Conference on Computer Science and Engineering (UBMK), IEEE. pp. 574–579.
  114. Neural autoregressive distribution estimation. The Journal of Machine Learning Research 17, 7184–7220.
  115. Approaching sales forecasting using recurrent neural networks and transformers. Expert Systems with Applications 201, 116993.
  116. Pixel recurrent neural networks, in: International conference on machine learning, PMLR. pp. 1747–1756.
  117. Attention is all you need. Advances in neural information processing systems 30.
  118. A connection between score matching and denoising autoencoders. Neural computation 23, 1661–1674.
  119. Are both generative ai and chatgpt game changers for 21st-century operations and supply chain excellence? International Journal of Production Economics 265, 109015.
  120. Flight demand forecasting with transformers, in: AIAA AVIATION 2022 Forum, p. 3708.
  121. Coffee: Counterfactual fairness for personalized text generation in explainable recommendation. doi:10.48550/ARXIV.2210.15500.
  122. Deep reinforcement learning for transportation network combinatorial optimization: A survey. Knowledge-Based Systems 233, 107526. doi:10.1016/j.knosys.2021.107526.
  123. User-controllable recommendation against filter bubbles, in: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM. pp. 1251–1261. doi:10.1145/3477495.3532075.
  124. Transformers in time series: A survey. arXiv preprint arXiv:2202.07125 .
  125. Learning from drivers to tackle the amazon last mile routing research challenge. arXiv preprint arXiv:2205.04001 .
  126. Wasserstein learning of deep generative point process models. Advances in neural information processing systems 30.
  127. An overview of deep generative models. IETE Technical Review 32, 131–139.
  128. Openp5: Benchmarking foundation models for recommendation. doi:10.48550/ARXIV.2306.11134.
  129. Are transformers effective for time series forecasting? arxiv 2022. arXiv preprint arXiv:2205.13504 .
  130. Intermittent demand forecasting with transformer neural networks. Annals of Operations Research , 1–22.
  131. Recommendation as instruction following: A large language model empowered recommendation approach. arXiv preprint arXiv:2305.07001 .
  132. Learning neural point processes with latent graphs, in: Proceedings of the Web Conference 2021, pp. 1495–1505.
  133. Energy-based generative adversarial network. arXiv preprint arXiv:1609.03126 .
  134. Inductive graph transformer for delivery time estimation, in: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, ACM. pp. 679–687. doi:10.1145/3539597.3570409.
  135. Unpaired image-to-image translation using cycle-consistent adversarial networks, in: Proceedings of the IEEE international conference on computer vision, pp. 2223–2232.
  136. Transformer hawkes process, in: International conference on machine learning, PMLR. pp. 11692–11702.
Citations (2)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.