Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
175 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Optimizing accuracy and diversity: a multi-task approach to forecast combinations (2310.20545v2)

Published 31 Oct 2023 in cs.LG, math.OC, and stat.ML

Abstract: Forecast combination involves using multiple forecasts to create a single, more accurate prediction. Recently, feature-based forecasting has been employed to either select the most appropriate forecasting models or to optimize the weights of their combination. In this paper, we present a multi-task optimization paradigm that focuses on solving both problems simultaneously and enriches current operational research approaches to forecasting. In essence, it incorporates an additional learning and optimization task into the standard feature-based forecasting approach, focusing on the identification of an optimal set of forecasting methods. During the training phase, an optimization model with linear constraints and quadratic objective function is employed to identify accurate and diverse methods for each time series. Moreover, within the training phase, a neural network is used to learn the behavior of that optimization model. Once training is completed the candidate set of methods is identified using the network. The proposed approach elicits the essential role of diversity in feature-based forecasting and highlights the interplay between model combination and model selection when optimizing forecasting ensembles. Experimental results on a large set of series from the M4 competition dataset show that our proposal enhances point forecast accuracy compared to state-of-the-art methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (61)
  1. Combination of long term and short term forecasts, with application to tourism demand forecasting. International Journal of Forecasting, 27, 870–886.
  2. Armstrong, J. S. (2001). Combining forecasts. In Principles of Forecasting: A Handbook for Researchers and Practitioners (pp. 417–439). Boston, MA: Springer US.
  3. The theta model: a decomposition approach to forecasting. International Journal of Forecasting, 16, 521–530.
  4. Atiya, A. F. (2020). Why does forecast combination work so well? International Journal of Forecasting, 36, 197–200.
  5. The combination of forecasts. Journal of the Operational Research Society, 20, 451–468.
  6. Bunn, D. W. (1988). Combining forecasts. European Journal of Operational Research, 33, 223–229.
  7. A combination selection algorithm on forecasting. European Journal of Operational Research, 234, 127–139.
  8. Caruana, R. (1997). Multitask learning. Machine Learning, 28, 41–75.
  9. Addressing imbalance in multilabel classification: Measures and random resampling algorithms. Neurocomputing, 163, 3–16.
  10. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD ’16 (p. 785–794). New York, NY, USA: Association for Computing Machinery.
  11. Clemen, R. T. (1989). Combining forecasts: A review and annotated bibliography. International Journal of Forecasting, 5, 559–583.
  12. STL: A seasonal-trend decomposition. Journal of Official Statistics, 6, 3–73.
  13. Forecasting time series with complex seasonal patterns using exponential smoothing. Journal of the American Statistical Association, 106, 1513–1527.
  14. Review of guidelines for the use of combined forecasts. European Journal of Operational Research, 120, 190–204.
  15. Di Gangi, L. (2022). Sparse convex combinations of forecasting models by meta learning. Expert Systems with Applications, 200, 116938.
  16. Forecasting and operational research: a review. Journal of the Operational Research Society, 59, 1150–1172.
  17. Optimization problems for machine learning: A survey. European Journal of Operational Research, 290, 807–828.
  18. Combining density forecasts. International Journal of Forecasting, 23, 1–13.
  19. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7132–7141).
  20. forecast: Forecasting functions for time series and linear models. URL: https://pkg.robjhyndman.com/forecast/ R package version 8.21.
  21. Hyndman, R. J. (2020). A brief history of forecasting competitions. International Journal of Forecasting, 36, 7–14.
  22. Automatic time series forecasting: the forecast package for R. Journal of Statistical Software, 27, 1–22.
  23. Another look at measures of forecast accuracy. International Journal of Forecasting, 22, 679–688.
  24. A state space framework for automatic forecasting using exponential smoothing methods. International Journal of Forecasting, 18, 439–454.
  25. Simple robust averages of forecasts: Some empirical results. International Journal of Forecasting, 24, 163–169.
  26. Forecast with forecasts: Diversity matters. European Journal of Operational Research, 301, 180–190.
  27. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, .
  28. The M3 competition: Statistical tests of the results. International Journal of Forecasting, 21, 397–409.
  29. Kourentzes, N. (2022). tsutils: Time Series Exploration, Modelling and Forecasting. URL: https://CRAN.R-project.org/package=tsutils r package version 0.9.3.
  30. Cross-temporal coherent forecasts for australian tourism. Annals of Tourism Research, 75, 393–409.
  31. Another look at forecast selection and combination: Evidence from forecast pooling. International Journal of Production Economics, 209, 226–235.
  32. Deep learning in business analytics and operations research: Models, applications and managerial implications. European Journal of Operational Research, 281, 628–641.
  33. Neural network ensembles, cross validation, and active learning. Advances in Neural Information Processing Systems, 7.
  34. Meta-learning for time series forecasting and forecast combination. Neurocomputing, 73, 2006–2016.
  35. Forecasting with time series imaging. Expert Systems with Applications, 160, 113680.
  36. A survey of convolutional neural networks: analysis, applications, and prospects. IEEE Transactions on Neural Networks and Learning Systems, .
  37. Why do some combinations perform better than others? International Journal of Forecasting, 36, 142–149.
  38. Nonpooling convolutional neural network forecasting for seasonal time series with trends. IEEE Transactions on Neural Networks and Learning Systems, 31, 2879–2888.
  39. Retail sales forecasting with meta-learning. European Journal of Operational Research, 288, 111–128.
  40. The accuracy of extrapolation (time series) methods: Results of a forecasting competition. Journal of Forecasting, 1, 111–153.
  41. The m4 competition: 100,000 time series and 61 forecasting methods. International Journal of Forecasting, 36, 54–74.
  42. A machine learning approach for forecasting hierarchical time series. Expert Systems with Applications, 182, 115102.
  43. Fforma: Feature-based forecast model averaging. International Journal of Forecasting, 36, 86–92.
  44. M4comp2018: Data from the M4-Competition. R package version 0.2.0.
  45. Nikolopoulos, K. (2021). We need to talk about intermittent demand forecasting. European Journal of Operational Research, 291, 549–559.
  46. Forecasting: theory and practice. International Journal of Forecasting, 38, 705–871.
  47. Exploring the sources of uncertainty: Why does bagging for time series forecasting work? European Journal of Operational Research, 268, 545–554.
  48. Meta-learning approaches to selecting time series models. Neurocomputing, 61, 121–137.
  49. Quadratic programming feature selection. Journal of Machine Learning Research, .
  50. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision (pp. 618–626).
  51. Image-based time series forecasting: A deep convolutional neural network approach. Neural Networks, 157, 39–53.
  52. Generalizing the theta method for automatic forecasting. European Journal of Operational Research, 284, 550–558.
  53. Supply chain forecasting: Theory, practice, their gap and the future. European Journal of Operational Research, 252, 1–26.
  54. Forecasting for inventory planning: a 50-year review. Journal of the Operational Research Society, 60, S149–S160.
  55. Meta-learning how to forecast time series. Monash Econometrics and Business Statistics Working Papers, 6, 16.
  56. A review of methods for imbalanced multi-label classification. Pattern Recognition, 118, 107965.
  57. Taylor, J. W. (2017). Probabilistic forecasting of wind power ramp events using autoregressive logit models. European Journal of Operational Research, 259, 703–712.
  58. Timmermann, A. (2006). Forecast combinations. Handbook of Economic Forecasting, 1, 135–196.
  59. Financial risk forecasting with nonlinear dynamics and support vector regression. Journal of the operational research society, 60, 685–695.
  60. Sensitivity of weights in combining forecasts. Operations Research, 40, 609–614.
  61. A survey on multi-task learning. IEEE Transactions on Knowledge and Data Engineering, 34, 5586–5609.

Summary

We haven't generated a summary for this paper yet.