MODL: Multilearner Online Deep Learning (2405.18281v2)
Abstract: Online deep learning tackles the challenge of learning from data streams by balancing two competing goals: fast learning and deep learning. However, existing research primarily emphasizes deep learning solutions, which are more adept at handling the deep'' aspect than the
fast'' aspect of online learning. In this work, we introduce an alternative paradigm through a hybrid multilearner approach. We begin by developing a fast online logistic regression learner, which operates without relying on backpropagation. It leverages closed-form recursive updates of model parameters, efficiently addressing the fast learning component of the online learning challenge. This approach is further integrated with a cascaded multilearner design, where shallow and deep learners are co-trained in a cooperative, synergistic manner to solve the online learning problem. We demonstrate that this approach achieves state-of-the-art performance on standard online learning datasets. We make our code available: https://github.com/AntonValk/MODL
- Omar Abdel Wahab. Intrusion detection in the IoT under data and concept drifts: Online deep learning approach. IEEE J. Internet of Things, 9(20):19706–19716, 2022.
- Efficient methods for online multiclass logistic regression. In Proc. Int. Conf. Alg. Learning Theory (ALT), Online, Mar. 2021.
- Aux-drop: Handling haphazard inputs in online learning using auxiliary dropouts. Trans. Mach. Learn. Research (TMLR), 2023. ISSN 2835-8856.
- Searching for exotic particles in high-energy physics with deep learning. Nature Communications, 5, 2014.
- Adaptive online gradient descent. In Proc. Neural Info. Proces. Sys. (NIPS), page 65–72, Red Hook, NY, USA, Dec. 2007.
- Data preservation in high energy physics. Euro. Phys. J. C, 83:1–41, 2023.
- Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. J. Mach. Learn. Res., 7:2399–2434, 2006.
- Online learning from data streams with varying feature spaces. In Proc. Conf. Artificial Intell. (AAAI), pages 3232–3239, Feb. 2019. ISBN 978-1-57735-809-1.
- Dankmar Böhning. Multinomial logistic regression algorithm. Annals of the Institute of Statistical Mathematics, 44:197–200, 1992.
- Léon Bottou. Online Learning and Stochastic Approximations, chapter 2, pages 9–42. Cambridge University Press, 1998.
- Large scale online learning. In Proc. Adv. Neural Info. Proces. Sys. (NIPS), pages 217–224, Vancouver, Canada, Dec. 2003.
- Libsvm: A library for support vector machines. ACM Trans. Intell. Syst. Technol., 2(3), may 2011. ISSN 2157-6904.
- Stochastic online optimization using kalman recursion. J. Machine Learning Research, 22(223):1–55, 2021.
- Confidence-weighted linear classification. In Proc. Int. Conf. Machine Learning (ICML), page 264–271, Helsinki, Finland, Jul. 2008.
- Uci machine learning repository, 2017.
- A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119–139, 1997. ISSN 0022-0000.
- Just-in-time deep learning for real-time x-ray computed tomography. Scientific Reports, 13, 2023.
- Clustering data streams. In Proc. Foundations Comp. Science, (FOCS), pages 359–366, Redondo Beach, California, USA, Nov. 2000.
- Logarithmic regret algorithms for online convex optimization. Mach. Learn., 69(2-3):169–192, 2007.
- Online learning from capricious data streams: A generative approach. In Proc. Int. Joint Conf. on Artificial Intell. (IJCAI), pages 2491–2497, Jul. 2019.
- Online multiple kernel classification. Mach. Learn., 90(2):289–316, 2013.
- Physics-guided deep neural networks for power flow analysis. IEEE Trans. Power Systems, 36(3):2082–2092, 2021.
- Self-supervised deep visual odometry with online adaptation. In Proc. IEEE/CVF Conf. Comp. Vision Pattern Recognition (CVPR), pages 6339–6348, Online, Jun. 2020.
- Deep learning enhanced Rydberg multifrequency microwave recognition. Nature Commun., 13(1997):1–10, 2022.
- One-pass learning via bridging orthogonal gradient descent and recursive least-squares. In Proc. IEEE Conf. Decision & Control (CDC), pages 4720–4725, 2022.
- Protores: Proto-residual network for pose authoring via learned inverse kinematics. In Proc. Int. Conf. Learning Representations (ICLR), Online, Apr. 2022.
- Online deep learning: learning deep neural networks on the fly. In Proc. Int. Joint Conf. Artificial Intelligence (IJCAI), page 2660–2666, Stockholm, Sweden, Jul. 2018.
- Adaptive online learning for iot botnet detection. Information Sciences, 574:84–95, 2021. ISSN 0020-0255.
- Burn after reading: Online adaptation for cross-domain streaming data. In Proc. Euro. Conf. Comp. Vision (ECCV), page 404–422, Tel Aviv, Israel, Oct. 2022.
- Onrl: improving mobile video telephony via online reinforcement learning. In Proc. Int. Conf. Mobile Comp. Networking, pages 1–14, London, United Kingdom, Sep. 2020.
- Martin A. Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In Proc. Int. Conf. Machine Learning (ICML), Washington, DC, USA, Aug. 2003.