DoubleAdapt: A Meta-learning Approach to Incremental Learning for Stock Trend Forecasting (2306.09862v3)
Abstract: Stock trend forecasting is a fundamental task of quantitative investment where precise predictions of price trends are indispensable. As an online service, stock data continuously arrive over time. It is practical and efficient to incrementally update the forecast model with the latest data which may reveal some new patterns recurring in the future stock market. However, incremental learning for stock trend forecasting still remains under-explored due to the challenge of distribution shifts (a.k.a. concept drifts). With the stock market dynamically evolving, the distribution of future data can slightly or significantly differ from incremental data, hindering the effectiveness of incremental updates. To address this challenge, we propose DoubleAdapt, an end-to-end framework with two adapters, which can effectively adapt the data and the model to mitigate the effects of distribution shifts. Our key insight is to automatically learn how to adapt stock data into a locally stationary distribution in favor of profitable updates. Complemented by data adaptation, we can confidently adapt the model parameters under mitigated distribution shifts. We cast each incremental learning task as a meta-learning task and automatically optimize the adapters for desirable data adaptation and parameter initialization. Experiments on real-world stock datasets demonstrate that DoubleAdapt achieves state-of-the-art predictive performance and shows considerable efficiency.
- TaskNorm: Rethinking Batch Normalization for Meta-learning. In International Conference on Machine Learning. PMLR, 1153–1164.
- Online Fast Adaptation and Knowledge Accumulation (OSAKA): a New Approach to Continual Learning. In NeurIPS. https://proceedings.neurips.cc/paper/2020/hash/c0a271bc0ecb776a094786474322cb82-Abstract.html
- Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. (Dec. 2014). arXiv:1412.3555 [cs.NE]
- AdaRNN: Adaptive Learning and Forecasting for Time Series. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM. https://doi.org/10.1145/3459637.3482315
- MetaNorm: Learning to Normalize Few-Shot Batches Across Domains. In International Conference on Learning Representations. https://openreview.net/forum?id=9z_dNsC4B5t
- Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. In Proceedings of the 34th International Conference on Machine Learning - Volume 70 (Sydney, NSW, Australia) (ICML’17). JMLR.org, 1126–1135.
- Online Meta-Learning. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 1920–1930. https://proceedings.mlr.press/v97/finn19a.html
- A survey on concept drift adaptation. Comput. Surveys 46, 4 (apr 2014), 1–37. https://doi.org/10.1145/2523813
- Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. https://doi.org/10.1109/cvpr.2016.90
- Task Agnostic Continual Learning via Meta Learning. (June 2019). arXiv:1906.05201 [stat.ML]
- Task Agnostic Continual Learning via Meta Learning. In 4th Lifelong Machine Learning Workshop at ICML 2020. https://openreview.net/forum?id=AeIzVxdJgeb
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation 9, 8 (nov 1997), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
- Stock Trend Prediction with Multi-granularity Data: A Contrastive Learning Approach with Adaptive Fusion. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM. https://doi.org/10.1145/3459637.3482483
- Listening to Chaotic Whispers: A Deep Learning Framework for News-oriented Stock Trend Prediction. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining. ACM. https://doi.org/10.1145/3159652.3159690
- Normalizing Flows: An Introduction and Review of Current Methods. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 11 (nov 2021), 3964–3979. https://doi.org/10.1109/tpami.2020.2992934
- Stock Price Prediction Using Attention-based Multi-Input LSTM. In Asian Conference on Machine Learning.
- DDG-DA: Data Distribution Generation for Predictable Concept Drift Adaptation. Proceedings of the AAAI Conference on Artificial Intelligence 36, 4 (jun 2022), 4092–4100. https://doi.org/10.1609/aaai.v36i4.20327
- Individualized Indicator for All: Stock-wise Technical Indicator Optimization with Stock Embedding. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM. https://doi.org/10.1145/3292500.3330833
- Learning Multiple Stock Trading Patterns with Temporal Routing Adaptor and Optimal Transport. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. ACM. https://doi.org/10.1145/3447548.3467358
- Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization. Proceedings of the National Academy of Sciences 115, 44 (2018), E10467–E10475. https://doi.org/10.1073/pnas.1803839115 arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.1803839115
- Deep Online Learning Via Meta-Learning: Continual Adaptation for Model-Based RL. In International Conference on Learning Representations. https://openreview.net/forum?id=HyxAfnA5tm
- Meta Pseudo Labels. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020), 11552–11563.
- A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. International Joint Conferences on Artificial Intelligence Organization. https://doi.org/10.24963/ijcai.2017/366
- Instance Normalization: The Missing Ingredient for Fast Stylization. ArXiv abs/1607.08022 (2016).
- Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research (JMLR) 9 (2008), 2579–2605. www.jmlr.org/papers/v9/vandermaaten08a.html
- Attention is All you Need. ArXiv abs/1706.03762 (2017).
- Ronald J. Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8 (1992), 229–256.
- HIST: A Graph-based Framework for Stock Trend Forecasting via Mining Concept-Oriented Shared Information. ArXiv abs/2110.13716 (2021).
- REST: Relational Event-driven Stock Trend Forecasting. In Proceedings of the Web Conference 2021. ACM. https://doi.org/10.1145/3442381.3450032
- Qlib: An AI-oriented Quantitative Investment Platform. ArXiv abs/2009.11189 (2020).
- Accurate Multivariate Stock Movement Prediction via Data-Axis Transformer with Multi-Level Contexts. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. ACM. https://doi.org/10.1145/3447548.3467297
- ROLAND: Graph Learning Framework for Dynamic Graphs. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (Washington DC, USA) (KDD ’22). Association for Computing Machinery, New York, NY, USA, 2358–2366. https://doi.org/10.1145/3534678.3539300
- Learning to Learn the Future: Modeling Concept Drift in Time Series Prediction. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM. https://doi.org/10.1145/3459637.3482271
- Meta-Adaptive Stock Movement Prediction with Two-Stage Representation Learning. In NeurIPS 2022 Workshop on Distribution Shifts: Connecting Methods and Applications. https://openreview.net/forum?id=uf44d5H1vx
- Stock Price Prediction via Discovering Multi-Frequency Trading Patterns. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM. https://doi.org/10.1145/3097983.3098117
- How to Retrain Recommender System?. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM. https://doi.org/10.1145/3397271.3401167
- Forecasting Wavelet Transformed Time Series with Attentive Neural Networks. In 2018 IEEE International Conference on Data Mining (ICDM). IEEE. https://doi.org/10.1109/icdm.2018.00201
- Martin A. Zinkevich. 2003. Online Convex Programming and Generalized Infinitesimal Gradient Ascent. In International Conference on Machine Learning.
- Lifan Zhao (8 papers)
- Shuming Kong (1 paper)
- Yanyan Shen (54 papers)