Spatial-Temporal Interplay in Human Mobility: A Hierarchical Reinforcement Learning Approach with Hypergraph Representation (2312.15717v1)
Abstract: In the realm of human mobility, the decision-making process for selecting the next-visit location is intricately influenced by a trade-off between spatial and temporal constraints, which are reflective of individual needs and preferences. This trade-off, however, varies across individuals, making the modeling of these spatial-temporal dynamics a formidable challenge. To address the problem, in this work, we introduce the "Spatial-temporal Induced Hierarchical Reinforcement Learning" (STI-HRL) framework, for capturing the interplay between spatial and temporal factors in human mobility decision-making. Specifically, STI-HRL employs a two-tiered decision-making process: the low-level focuses on disentangling spatial and temporal preferences using dedicated agents, while the high-level integrates these considerations to finalize the decision. To complement the hierarchical decision setting, we construct a hypergraph to organize historical data, encapsulating the multi-aspect semantics of human mobility. We propose a cross-channel hypergraph embedding module to learn the representations as the states to facilitate the decision-making cycle. Our extensive experiments on two real-world datasets validate the superiority of STI-HRL over state-of-the-art methods in predicting users' next visits across various performance metrics.
- Optimality and approximation with policy gradient methods in markov decision processes. In Conference on Learning Theory, 64–66.
- Human mobility trace acquisition and social interactions monitoring for business intelligence using smartphones. In 2012 16th Panhellenic Conference on Informatics, 1–6.
- Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 135–146.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 273–297.
- Adaptive Path-Memory Network for Temporal Knowledge Graph Reasoning. arXiv preprint arXiv:2304.12604.
- Temporal Inductive Path Neural Network for Temporal Knowledge Graph Reasoning. arXiv preprint arXiv:2309.03251.
- Denoising-oriented deep hierarchical reinforcement learning for next-basket recommendation. In ICASSP 2022 IEEE International Conference on Acoustics, Speech and Signal Processing, 4093–4097.
- Deepmove: Predicting human mobility with attentional recurrent networks. In Proceedings of the 2018 World Wide Web Conference, 1459–1468.
- Reinforced explainable knowledge concept recommendation in MOOCs. ACM Transactions on Intelligent Systems and Technology, 1–20.
- Multi-view MOOC quality evaluation via information-aware graph representation learning. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 8070–8077.
- HST-LSTM: A hierarchical spatial-temporal long-short term memory network for location prediction. In Lang, J., ed., Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2341–2347.
- Geography-aware sequential location recommendation. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2009–2019.
- Hypergraph regularized low-rank tensor multi-view subspace clustering via L1 norm constraint. Applied Intelligence, 1–18.
- Predicting the next location: a recurrent model with spatial and temporal contexts. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 194–200.
- STAN: Spatio-temporal attention network for next location recommendation. In Leskovec, J.; Grobelnik, M.; Najork, M.; Tang, J.; and Zia, L., eds., Proceedings of the 2021 World Wide Web Conference, 2177–2185.
- Human-level control through deep reinforcement learning. Nature, 529–533.
- Equitable healthcare provision: uncovering the impact of the mobility effect on human development. Information Systems Management, 2–20.
- Factorizing personalized markov chains for next-basket recommendation. In Proceedings of the 2010 World Wide Web Conference, 811–820.
- A simple multi-armed nearest-neighbor bandit for interactive recommendation. In Proceedings of the 13th ACM Conference on Recommender Systems, 358–362.
- Where to go next: Modeling long- and short-term user preferences for point-of-interest recommendation. In Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 214–221.
- Reimagining city configuration: Automated urban planning via adversarial learning. In Proceedings of the 28th International Conference on Advances in Geographic Information Systems, 497–506.
- Reinforced imitative graph learning for mobile user profiling. IEEE Transactions on Knowledge and Data Engineering, 12944–12957.
- Joint charging and relocation recommendation for E-taxi drivers via multi-agent mean field hierarchical reinforcement learning. IEEE Transactions on Mobile Computing, 1274–1290.
- Adversarial substructured representation learning for mobile user profiling. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 130–138.
- Learning urban community structures: A collective embedding perspective with periodic spatial-temporal mobility graphs. ACM Transactions on Intelligent Systems and Technology, 1–28.
- Exploiting mutual information for substructure-aware graph representation learning. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 3415–3421.
- Incremental mobile user profiling: Reinforcement learning with spatial knowledge graph for modeling event streams. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 853–861.
- Next point-of-interest recommendation on resource-constrained mobile devices. In Huang, Y.; King, I.; Liu, T.; and van Steen, M., eds., Proceedings of the 2020 World Wide Web Conference, 906–916.
- Learning graph-based disentangled representations for next POI recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 1154–1163.
- Hierarchical reinforcement learning for integrated recommendation. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 4521–4528.
- Spatio-temporal hypergraph learning for next POI recommendation. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 403–412.
- Location prediction over sparse user mobility traces using rnns. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2184–2190.
- Revisiting user mobility and social relationships in LBSNs: a hypergraph embedding approach. In Proceedings of the 2019 World Wide Web Conference, 2147–2157.
- Modeling user activity preference by leveraging user spatial temporal characteristics in LBSNs. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 129–142.
- GETNext: Trajectory flow map enhanced transformer for next POI recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 1144–1153.
- Expanrl: Hierarchical reinforcement learning for course concept expansion in MOOCs. In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 770–780.
- Hierarchical reinforcement learning for course recommendation in MOOCs. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 435–442.
- Where to go next: A spatio-temporal gated network for next poi recommendation. IEEE Transactions on Knowledge and Data Engineering, 2512–2524.