Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DoubleAdapt: A Meta-learning Approach to Incremental Learning for Stock Trend Forecasting (2306.09862v3)

Published 16 Jun 2023 in q-fin.ST, cs.AI, cs.CE, cs.LG, and q-fin.CP

Abstract: Stock trend forecasting is a fundamental task of quantitative investment where precise predictions of price trends are indispensable. As an online service, stock data continuously arrive over time. It is practical and efficient to incrementally update the forecast model with the latest data which may reveal some new patterns recurring in the future stock market. However, incremental learning for stock trend forecasting still remains under-explored due to the challenge of distribution shifts (a.k.a. concept drifts). With the stock market dynamically evolving, the distribution of future data can slightly or significantly differ from incremental data, hindering the effectiveness of incremental updates. To address this challenge, we propose DoubleAdapt, an end-to-end framework with two adapters, which can effectively adapt the data and the model to mitigate the effects of distribution shifts. Our key insight is to automatically learn how to adapt stock data into a locally stationary distribution in favor of profitable updates. Complemented by data adaptation, we can confidently adapt the model parameters under mitigated distribution shifts. We cast each incremental learning task as a meta-learning task and automatically optimize the adapters for desirable data adaptation and parameter initialization. Experiments on real-world stock datasets demonstrate that DoubleAdapt achieves state-of-the-art predictive performance and shows considerable efficiency.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. TaskNorm: Rethinking Batch Normalization for Meta-learning. In International Conference on Machine Learning. PMLR, 1153–1164.
  2. Online Fast Adaptation and Knowledge Accumulation (OSAKA): a New Approach to Continual Learning. In NeurIPS. https://proceedings.neurips.cc/paper/2020/hash/c0a271bc0ecb776a094786474322cb82-Abstract.html
  3. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. (Dec. 2014). arXiv:1412.3555 [cs.NE]
  4. AdaRNN: Adaptive Learning and Forecasting for Time Series. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM. https://doi.org/10.1145/3459637.3482315
  5. MetaNorm: Learning to Normalize Few-Shot Batches Across Domains. In International Conference on Learning Representations. https://openreview.net/forum?id=9z_dNsC4B5t
  6. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. In Proceedings of the 34th International Conference on Machine Learning - Volume 70 (Sydney, NSW, Australia) (ICML’17). JMLR.org, 1126–1135.
  7. Online Meta-Learning. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 1920–1930. https://proceedings.mlr.press/v97/finn19a.html
  8. A survey on concept drift adaptation. Comput. Surveys 46, 4 (apr 2014), 1–37. https://doi.org/10.1145/2523813
  9. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. https://doi.org/10.1109/cvpr.2016.90
  10. Task Agnostic Continual Learning via Meta Learning. (June 2019). arXiv:1906.05201 [stat.ML]
  11. Task Agnostic Continual Learning via Meta Learning. In 4th Lifelong Machine Learning Workshop at ICML 2020. https://openreview.net/forum?id=AeIzVxdJgeb
  12. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation 9, 8 (nov 1997), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
  13. Stock Trend Prediction with Multi-granularity Data: A Contrastive Learning Approach with Adaptive Fusion. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM. https://doi.org/10.1145/3459637.3482483
  14. Listening to Chaotic Whispers: A Deep Learning Framework for News-oriented Stock Trend Prediction. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining. ACM. https://doi.org/10.1145/3159652.3159690
  15. Normalizing Flows: An Introduction and Review of Current Methods. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 11 (nov 2021), 3964–3979. https://doi.org/10.1109/tpami.2020.2992934
  16. Stock Price Prediction Using Attention-based Multi-Input LSTM. In Asian Conference on Machine Learning.
  17. DDG-DA: Data Distribution Generation for Predictable Concept Drift Adaptation. Proceedings of the AAAI Conference on Artificial Intelligence 36, 4 (jun 2022), 4092–4100. https://doi.org/10.1609/aaai.v36i4.20327
  18. Individualized Indicator for All: Stock-wise Technical Indicator Optimization with Stock Embedding. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM. https://doi.org/10.1145/3292500.3330833
  19. Learning Multiple Stock Trading Patterns with Temporal Routing Adaptor and Optimal Transport. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. ACM. https://doi.org/10.1145/3447548.3467358
  20. Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization. Proceedings of the National Academy of Sciences 115, 44 (2018), E10467–E10475. https://doi.org/10.1073/pnas.1803839115 arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.1803839115
  21. Deep Online Learning Via Meta-Learning: Continual Adaptation for Model-Based RL. In International Conference on Learning Representations. https://openreview.net/forum?id=HyxAfnA5tm
  22. Meta Pseudo Labels. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020), 11552–11563.
  23. A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. International Joint Conferences on Artificial Intelligence Organization. https://doi.org/10.24963/ijcai.2017/366
  24. Instance Normalization: The Missing Ingredient for Fast Stylization. ArXiv abs/1607.08022 (2016).
  25. Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research (JMLR) 9 (2008), 2579–2605. www.jmlr.org/papers/v9/vandermaaten08a.html
  26. Attention is All you Need. ArXiv abs/1706.03762 (2017).
  27. Ronald J. Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8 (1992), 229–256.
  28. HIST: A Graph-based Framework for Stock Trend Forecasting via Mining Concept-Oriented Shared Information. ArXiv abs/2110.13716 (2021).
  29. REST: Relational Event-driven Stock Trend Forecasting. In Proceedings of the Web Conference 2021. ACM. https://doi.org/10.1145/3442381.3450032
  30. Qlib: An AI-oriented Quantitative Investment Platform. ArXiv abs/2009.11189 (2020).
  31. Accurate Multivariate Stock Movement Prediction via Data-Axis Transformer with Multi-Level Contexts. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. ACM. https://doi.org/10.1145/3447548.3467297
  32. ROLAND: Graph Learning Framework for Dynamic Graphs. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (Washington DC, USA) (KDD ’22). Association for Computing Machinery, New York, NY, USA, 2358–2366. https://doi.org/10.1145/3534678.3539300
  33. Learning to Learn the Future: Modeling Concept Drift in Time Series Prediction. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM. https://doi.org/10.1145/3459637.3482271
  34. Meta-Adaptive Stock Movement Prediction with Two-Stage Representation Learning. In NeurIPS 2022 Workshop on Distribution Shifts: Connecting Methods and Applications. https://openreview.net/forum?id=uf44d5H1vx
  35. Stock Price Prediction via Discovering Multi-Frequency Trading Patterns. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM. https://doi.org/10.1145/3097983.3098117
  36. How to Retrain Recommender System?. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM. https://doi.org/10.1145/3397271.3401167
  37. Forecasting Wavelet Transformed Time Series with Attentive Neural Networks. In 2018 IEEE International Conference on Data Mining (ICDM). IEEE. https://doi.org/10.1109/icdm.2018.00201
  38. Martin A. Zinkevich. 2003. Online Convex Programming and Generalized Infinitesimal Gradient Ascent. In International Conference on Machine Learning.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Lifan Zhao (8 papers)
  2. Shuming Kong (1 paper)
  3. Yanyan Shen (54 papers)
Citations (17)