Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Data (2401.14544v1)
Abstract: Bayesian optimization (BO) has established itself as a leading strategy for efficiently optimizing expensive-to-evaluate functions. Existing BO methods mostly rely on Gaussian process (GP) surrogate models and are not applicable to (doubly-stochastic) Gaussian Cox processes, where the observation process is modulated by a latent intensity function modeled as a GP. In this paper, we propose a novel maximum a posteriori inference of Gaussian Cox processes. It leverages the Laplace approximation and change of kernel technique to transform the problem into a new reproducing kernel Hilbert space, where it becomes more tractable computationally. It enables us to obtain both a functional posterior of the latent intensity function and the covariance of the posterior, thus extending existing works that often focus on specific link functions or estimating the posterior mean. Using the result, we propose a BO framework based on the Gaussian Cox process model and further develop a Nystr\"om approximation for efficient computation. Extensive evaluations on various synthetic and real-world datasets demonstrate significant improvement over state-of-the-art inference solutions for Gaussian Cox processes, as well as effective BO with a wide range of acquisition functions designed through the underlying Gaussian Cox process model.
- Practical Bayesian optimization for model fitting with Bayesian adaptive direct search. Advances in Neural Information Processing Systems, 30:1836–1846, 2017.
- Bayesian online changepoint detection. arXiv preprint arXiv:0710.3742, 2007.
- Tractable nonparametric Bayesian inference in Poisson processes with Gaussian process intensities. In Proceedings of the 26th International Conference on Machine Learning, pp. 9–16. PMLR, 2009.
- Structured variational inference in continuous Cox process models. Advances in Neural Information Processing Systems, 32, 2019.
- Christopher TH Baker and RL Taylor. The numerical treatment of integral equations. Journal of Applied Mechanics, 46(4):969, 1979.
- A Cox process with log-normal intensity. Insurance: Mathematics and Economics, 31(2):297–302, 2002.
- Scalable multi-agent covering option discovery based on kronecker graphs. Advances in Neural Information Processing Systems, 35:30406–30418, 2022.
- Minimizing return gaps with discrete communications in decentralized pomdp. arXiv preprint arXiv:2308.03358, 2023.
- Bringing fairness to actor-critic reinforcement learning for network utility optimization. In IEEE INFOCOM 2021-IEEE Conference on Computer Communications, pp. 1–10. IEEE, 2021.
- Fast Gaussian process methods for point process intensity estimation. In Proceedings of the 25th International Conference on Machine Learning, pp. 192–199, 2008.
- BOAT: Building auto-tuners with structured Bayesian optimization. In Proceedings of the 26th International Conference on World Wide Web, pp. 479–488, 2017.
- DC.gov. 2022 Crime incidents in Washington, DC, 2022. URL https://opendata.dc.gov/datasets/DCGIS::crime-incidents-in-2022/about. [Accessed: August 02, 2023].
- Spatial and spatio-temporal log-Gaussian Cox processes: Extending the geostatistical paradigm. Statistical Science, 28(4):542–563, 2013.
- Efficient Bayesian inference of sigmoidal Gaussian Cox processes. The Journal of Machine Learning Research, 19(1):2710–2743, 2018.
- Poisson intensity estimation with reproducing kernels. In International Conference on Artificial Intelligence and Statistics, pp. 270–279. PMLR, 2017.
- Accmer: Accelerating multi-agent experience replay with cache locality-aware prioritization. In 2023 IEEE 34th International Conference on Application-specific Systems, Architectures and Processors (ASAP), pp. 205–212. IEEE, 2023.
- Constrained Bayesian optimization for automatic chemical design using variational autoencoders. Chemical Science, 11(2):577–586, 2020.
- Efficient Bayesian nonparametric modelling of structured point processes. In Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, pp. 310–319, 2014.
- A toolbox for fitting complex spatial point process models using integrated nested Laplace approximation (INLA). Annals of Applied Statistics, 6(4):1499–1530, 2012.
- Richard G Jarrett. A note on the intervals between coal-mining disasters. Biometrika, 66(1):191–193, 1979.
- Hideaki Kim. Fast Bayesian inference for Gaussian Cox processes via path integral formulation. Advances in Neural Information Processing Systems, 34:26130–26142, 2021.
- Harold J. Kushner. A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise. Journal of Basic Engineering, 86:97–106, 1964.
- Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4–22, 1985.
- Simulation of nonhomogeneous Poisson processes by thinning. Naval Research Logistics Quarterly, 26(3):403–413, 1979.
- Decentralized on-ramp merging control of connected and automated vehicles in the mixed traffic using control barrier functions. In 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), pp. 1125–1131. IEEE, 2021.
- Safety-critical and flexible cooperative on-ramp merging control of connected and automated vehicles in mixed traffic. IEEE Transactions on Intelligent Transportation Systems, 24(3):2920–2934, 2023.
- Variational inference for Gaussian process modulated Poisson processes. In Proceedings of the 32nd International Conference on Machine Learning, pp. 1814–1822. PMLR, 2015.
- Bayesian hyperparameter optimization for deep neural network-based network intrusion detection. In 2021 IEEE International Conference on Big Data (Big Data), pp. 5413–5419. IEEE, 2021.
- A Bayesian optimization framework for finding local optima in expensive multimodal functions. arXiv preprint arXiv:2210.06635, 2022.
- Exploiting partial common information microstructure for multi-modal brain tumor segmentation. In Workshop on Machine Learning for Multimodal Healthcare Data, pp. 64–85. Springer, 2023a.
- MAC-PO: Multi-agent experience replay via collective priority optimization. In Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, pp. 466–475, 2023b.
- Projection-optimal monotonic value function factorization in multi-agent reinforcement learning. In Proceedings of the 2024 International Conference on Autonomous Agents and Multiagent Systems, 2024.
- Jonas Močkus. On Bayesian methods for seeking the extremum. In Proceedings of the 7th IFIP conference, pp. 400–404. Springer, 1975.
- Log Gaussian Cox processes. Scandinavian Journal of Statistics, 25(3):451–482, 1998.
- ECML/PKDD 15: Taxi Trajectory Prediction, 2015. URL https://kaggle.com/competitions/pkdd-15-predict-taxi-service-trajectory-i. [Accessed: August 10, 2023].
- Intermittent status updating with random update arrivals. In 2021 IEEE International Symposium on Information Theory (ISIT), pp. 3121–3126. IEEE, 2021.
- Adaptive on/off scheduling to minimize age of information in an energy harvesting receiver. IEEE Sensors Journal, pp. 1–1, 2023.
- Gaussian processes for machine learning, volume 1. Springer, 2006.
- Conjunctive representation of position, direction, and velocity in entorhinal cortex. Science, 312(5774):758–762, 2006.
- A generalized representer theorem. In International Conference on Computational Learning Theory, pp. 416–426. Springer, 2001.
- Gideon Schwarz. Estimating the dimension of a model. The Annals of Statistics, pp. 461–464, 1978.
- Bayesian intermittent demand forecasting for large inventories. Advances in Neural Information Processing Systems, 29, 2016.
- Accelerating Bayesian optimization for biological sequence design with denoising autoencoders. In Proceedings of the 39th International Conference on Machine Learning, pp. 20459–20478. PMLR, 2022.
- Fast Bayesian intensity estimation for the permanental process. In Proceedings of the 34th International Conference on Machine Learning, pp. 3579–3588. PMLR, 2017.
- Federated conditional stochastic optimization. arXiv preprint arXiv:2310.02524, 2023.
- Tkil: Tangent kernel optimization for class balanced incremental learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, pp. 3529–3539, October 2023.
- Knowledge distillation circumvents nonlinearity for optical convolutional neural networks. Applied Optics, 61(9):2173–2183, 2022.
- Vflh: A following-the-leader-history based algorithm for adaptive online convex optimization with stochastic constraints. In 2023 IEEE 35th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 172–177. IEEE, 2023a.
- Particle-based online bayesian sampling. arXiv preprint arXiv:2302.14796, 2023b.
- Optimizing the age of information with segmentation and predictive scheduling. In 2023 IEEE Wireless Communications and Networking Conference (WCNC), pp. 1–6, 2023.
- PAC: Assisted value factorization with counterfactual predictions in multi-agent reinforcement learning. Advances in Neural Information Processing Systems, 35:15757–15769, 2022.
- Every parameter matters: Ensuring the convergence of federated learning with dynamic heterogeneous models reduction. In Advances in Neural Information Processing Systems, volume 36, 2023.
- Yongsheng Mei (14 papers)
- Mahdi Imani (9 papers)
- Tian Lan (162 papers)