Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 28 tok/s Pro
GPT-5 High 42 tok/s Pro
GPT-4o 92 tok/s Pro
Kimi K2 187 tok/s Pro
GPT OSS 120B 431 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Automated Demand Forecasting in small to medium-sized enterprises (2412.20420v1)

Published 29 Dec 2024 in econ.EM

Abstract: In response to the growing demand for accurate demand forecasts, this research proposes a generalized automated sales forecasting pipeline tailored for small- to medium-sized enterprises (SMEs). Unlike large corporations with dedicated data scientists for sales forecasting, SMEs often lack such resources. To address this, we developed a comprehensive forecasting pipeline that automates time series sales forecasting, encompassing data preparation, model training, and selection based on validation results. The development included two main components: model preselection and the forecasting pipeline. In the first phase, state-of-the-art methods were evaluated on a showcase dataset, leading to the selection of ARIMA, SARIMAX, Holt-Winters Exponential Smoothing, Regression Tree, Dilated Convolutional Neural Networks, and Generalized Additive Models. An ensemble prediction of these models was also included. Long-Short-Term Memory (LSTM) networks were excluded due to suboptimal prediction accuracy, and Facebook Prophet was omitted for compatibility reasons. In the second phase, the proposed forecasting pipeline was tested with SMEs in the food and electric industries, revealing variable model performance across different companies. While one project-based company derived no benefit, others achieved superior forecasts compared to naive estimators. Our findings suggest that no single model is universally superior. Instead, a diverse set of models, when integrated within an automated validation framework, can significantly enhance forecasting accuracy for SMEs. These results emphasize the importance of model diversity and automated validation in addressing the unique needs of each business. This research contributes to the field by providing SMEs access to state-of-the-art sales forecasting tools, enabling data-driven decision-making and improving operational efficiency.

Summary

  • The paper presents a generalized, automated demand forecasting pipeline designed to enable small to medium-sized enterprises (SMEs) without dedicated data scientists to implement advanced sales forecasting.
  • The proposed pipeline integrates automated data preparation, training across diverse models (including statistical, tree-based, deep learning, and GAM), and selection of the best performing model based on validation results.
  • Testing with SMEs showed that model performance varied significantly depending on the company's data, that no single model was universally best, and that automated selection improved accuracy for some, with HWES surprisingly often recommended in practice.

This paper presents a generalized, automated demand forecasting pipeline designed for small to medium-sized enterprises (SMEs) that lack the resources of dedicated data scientists. The core idea is to automate time series sales forecasting, encompassing data preparation, model training, and model selection based on validation results, thereby providing SMEs with an advanced planning tool.

Problem Statement:

SMEs often lack the resources to develop and implement customized sales forecasting methods, hindering their ability to optimize production, purchasing, and logistics. Existing ERP systems often lack sophisticated forecasting tools that can be easily used by individuals without machine learning expertise.

Proposed Solution:

The authors propose an automated demand forecasting pipeline comprising two main phases: model preselection and forecasting pipeline development.

  • Model Preselection: Several state-of-the-art methods were tested on a showcase dataset to identify suitable candidates for the pipeline. The showcase dataset contained sales data for 51 products over a period of up to 6 years. The best six performing models (ARIMA, SARIMAX, Holt-Winters Exponential Smoothing, Regression Tree, Dilated Convolutional Neural Network, and Generalized Additive Model) were selected for the forecasting pipeline. Long-Short-Term Memory (LSTM) was excluded due to insufficient prediction accuracy for the 18-month horizon, and Facebook Prophet was excluded due to compatibility issues with the production environment despite its robustness.
  • Forecasting Pipeline: The proposed forecasting pipeline was tested with SMEs in the food and electric industries. The pipeline automates the process of training and evaluating different forecasting models on the SME's sales data, then recommends the model with the best validation performance.

Methods:

The paper employs a variety of time series forecasting methods:

  • Classical Statistical Models: ARIMA, SARIMAX, Holt-Winters Exponential Smoothing (HWES).
  • Tree-Based Methods: Regression Tree (XGBoost, Random Forest).
  • Deep Learning: Dilated Convolutional Neural Networks (CNN).
  • Generalized Additive Model (GAM): A decompositional GAM is developed, incorporating linear and exponential trends, seasonality modeled using Fourier transformation, and user-specific external features. Lasso penalization is used for coefficient estimation.
  • Ensemble Methods: The pipeline includes an ensemble prediction of the above models, using a simple mean or median function to aggregate the predictions.

The pipeline includes data preprocessing steps such as validity checks and formatting for weekly and monthly sales data. Model validation is performed using a one-year holdout period, and models are evaluated using Root Mean Squared Error (RMSE). After validation, all models are retrained using the full dataset, and forecasts are generated for the next 18 months or 78 weeks.

Experimental Setup and Results:

  • Model Preselection Phase: 33 different models and specifications were compared. The best performing models were primarily deep learning approaches (Dilated CNN, CNN), followed by XGBoost.
  • User Testing Phase: The automated pipeline was tested with five companies from the food and electric industries. The baseline model for comparison was a naive model that predicted future values based on the mean of previous years. The model's performance was evaluated by comparing its predictions to the actual sales data from 2023, using normalized RMSE (nRMSE) and MAPE.
  • Results:
    • The model performance varied significantly across different companies. For a company with project-based sales, the pipeline provided no benefit, while other companies experienced more realistic and superior sales forecasts compared to naive estimators.
    • No single model was universally superior. The best model for each company depended on the specific characteristics of its sales data.
    • Automated model selection based on internal validation significantly enhanced forecasting accuracy for some SMEs, but not for all. The Wilcoxon signed-rank test showed that the recommended models performed significantly better than the naive estimator in four out of five companies.
    • In practice, \acs{HWES} was recommended dominantly and showed the best model performance, which was not reflected in prior studies.
    • The CNNs that performed well in the preselection phase, did not perform as well in the end-to-end pipeline evaluation.

Contributions:

  • A generalized automated demand forecasting pipeline suitable for SMEs.
  • Integration of automatic validation and model selection.
  • An evaluation framework that allows the pipeline to adapt to various business contexts.
  • Demonstration of the importance of model diversity.
  • Evidence of the advantages of automatic validation to address the unique needs of each business.

Limitations and Future Work:

  • Data quality variations in real-world ERP systems can significantly impact model performance.
  • Long prediction horizons increase bias and can affect the validation process.
  • Future work could include additional features (marketing expenses, market growth), improved data preprocessing and validation techniques, and exploration of robust model selection approaches.
  • Future works could also focus on data from rare out-of-stock events or delayed deliveries are not collected, but might influence the practical relevance. With that, the data may not match the actual demand.

Conclusion:

The research provides a thorough assessment of an automated demand forecasting pipeline for SMEs, demonstrating its potential to address key real-world challenges. It offers valuable insights into model performance and introduces a viable approach for automated model evaluation. It is a step towards more efficient and scalable forecasting solutions.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 1 tweet and received 0 likes.

Upgrade to Pro to view all of the tweets about this paper:

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube