Probabilistic Modeling for Sequences of Sets in Continuous-Time (2312.15045v3)
Abstract: Neural marked temporal point processes have been a valuable addition to the existing toolbox of statistical parametric models for continuous-time event data. These models are useful for sequences where each event is associated with a single item (a single type of event or a "mark") -- but such models are not suited for the practical situation where each event is associated with a set of items. In this work, we develop a general framework for modeling set-valued data in continuous-time, compatible with any intensity-based recurrent neural point process model. In addition, we develop inference methods that can use such models to answer probabilistic queries such as "the probability of item $A$ being observed before item $B$," conditioned on sequence history. Computing exact answers for such queries is generally intractable for neural models due to both the continuous-time nature of the problem setting and the combinatorially-large space of potential outcomes for each event. To address this, we develop a class of importance sampling methods for querying with set-based sequences and demonstrate orders-of-magnitude improvements in efficiency over direct sampling via systematic experiments with four real-world datasets. We also illustrate how to use this framework to perform model selection using likelihoods that do not involve one-step-ahead prediction.
- Hawkes model for price and trades high-frequency dynamics. Quantitative Finance, 14(7):1147–1166.
- Inference of multivariate exponential Hawkes processes with inhibition and application to neuronal activity. Statistics and Computing, 33(4):91.
- Probabilistic querying of continuous-time event sequences. In International Conference on Artificial Intelligence and Statistics, pages 10235–10251. PMLR.
- Brillinger, D. R. (1975). The identification of point process systems. The Annals of Probability, pages 909–924.
- An Introduction to the Theory of Point Processes: Volume I: Elementary Theory and Methods, page 274. Springer.
- Long horizon forecasting with temporal point processes. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pages 571–579.
- Recurrent marked temporal point processes: Embedding event history to vector. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1555–1564.
- Network self-exciting point processes to measure health impacts of COVID-19. Journal of the Royal Statistical Society Series A: Statistics in Society, 186(3):401–421.
- Learning temporal point processes with intermittent observations. In International Conference on Artificial Intelligence and Statistics, pages 3790–3798. PMLR.
- The MovieLens datasets: History and context. ACM Transactions on Interactive Intelligent Systems (TIIS), 5(4):1–19.
- Hawkes, A. G. (1971). Spectra of some self-exciting and mutually exciting point processes. Biometrika, 58(1):83–90.
- Hawkes, A. G. (2018). Hawkes processes and their applications to finance: a review. Quantitative Finance, 18(2):193–198.
- From viral evolution to spatial contagion: a biologically modulated Hawkes model. Bioinformatics, 38(7):1846–1856.
- Sets2sets: Learning from sequential sets with neural networks. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1491–1499.
- Adam: A method for stochastic optimization. International Conference on Learning Representations (ICLR).
- Structured determinantal point processes. Advances in Neural Information Processing Systems, 23:1171–1179.
- Determinantal point processes for machine learning. Foundations and Trends® in Machine Learning, 5(2–3):123–286.
- Predicting dynamic embedding trajectory in temporal interaction networks. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 1269–1278.
- Comparison of prospective Hawkes and recursive point process models for Ebola in DRC. Journal of Forecasting, 41(1):201–210.
- Bridging recommendation and marketing via recurrent intensity modeling. In International Conference on Learning Representations.
- Macchi, O. (1975). The coincidence approach to stochastic point processes. Advances in Applied Probability, 7(1):83–122.
- The million song dataset challenge. In Proceedings of the 21st International Conference on World Wide Web, pages 909–916.
- The neural Hawkes process: A neurally self-modulating multivariate point process. Advances in Neural Information Processing Systems, 30:6757–6767.
- Imputing missing events in continuous-time event streams. In International Conference on Machine Learning, pages 4475–4485. PMLR.
- Ogata, Y. (1981). On Lewis’ simulation method for point processes. IEEE Transactions on Information Theory, 27(1):23–31.
- Mean-field limits for non-linear Hawkes processes with excitation and inhibition. Stochastic Processes and their Applications, 153:57–78.
- Toward accurate spatiotemporal COVID-19 risk scores using high-resolution real-world mobility data. ACM Transactions on Spatial Algorithms and Systems (TSAS), 8(2):1–30.
- Factorizing personalized Markov chains for next-basket recommendation. In Proceedings of the 19th International Conference on World Wide Web, pages 811–820.
- Intensity-free learning of temporal point processes. In International Conference on Learning Representations.
- Neural temporal point processes: A review. arXiv preprint arXiv:2104.03528.
- Hawkes process inference with missing data. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 32, pages 6425–6432.
- Concurrent multi-label prediction in event streams. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 9820–9828.
- Fastpoint: Scalable deep point processes. In Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: ECML/PKDD, pages 465–480. Springer.
- Van der Vaart, A. W. (2000). Asymptotic Statistics, volume 3. Cambridge University Press.
- Continuous time dynamic topic models. In Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence, pages 579–586.
- Topics over time: a non-Markov continuous-time model of topical trends. In Proceedings of the 12th ACM SIGKDD international Conference on Knowledge Discovery and Data Mining, pages 424–433.
- Classification of flash crashes using the Hawkes (p, q) framework. Quantitative Finance, 22(2):213–240.
- Deep determinantal point process for large-scale multi-label classification. In Proceedings of the IEEE International Conference on Computer Vision, pages 473–482.
- HYPRO: A hybridly normalized probabilistic model for long-horizon prediction of event sequences. Advances in Neural Information Processing Systems, 35:34641–34650.
- A dynamic recurrent model for next basket recommendation. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, pages 729–732.