Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
194 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Deep Learning for Cross-Domain Data Fusion in Urban Computing: Taxonomy, Advances, and Outlook (2402.19348v2)

Published 29 Feb 2024 in cs.LG and cs.AI

Abstract: As cities continue to burgeon, Urban Computing emerges as a pivotal discipline for sustainable development by harnessing the power of cross-domain data fusion from diverse sources (e.g., geographical, traffic, social media, and environmental data) and modalities (e.g., spatio-temporal, visual, and textual modalities). Recently, we are witnessing a rising trend that utilizes various deep-learning methods to facilitate cross-domain data fusion in smart cities. To this end, we propose the first survey that systematically reviews the latest advancements in deep learning-based data fusion methods tailored for urban computing. Specifically, we first delve into data perspective to comprehend the role of each modality and data source. Secondly, we classify the methodology into four primary categories: feature-based, alignment-based, contrast-based, and generation-based fusion methods. Thirdly, we further categorize multi-modal urban applications into seven types: urban planning, transportation, economy, public safety, society, environment, and energy. Compared with previous surveys, we focus more on the synergy of deep learning methods with urban computing applications. Furthermore, we shed light on the interplay between LLMs and urban computing, postulating future research directions that could revolutionize the field. We firmly believe that the taxonomy, progress, and prospects delineated in our survey stand poised to significantly enrich the research community. The summary of the comprehensive and up-to-date paper list can be found at https://github.com/yoshall/Awesome-Multimodal-Urban-Computing.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (391)
  1. Hierarchal assessment of noise pollution in urban areas–a case study. Transportation Research Part D: Transport and Environment 34, 95–103.
  2. Multi-feature, multi-modal, and multi-source social event detection: A comprehensive survey. Information Fusion 79, 279–308.
  3. Geospatial analyses of recent household surveys to assess changes in the distribution of zero-dose children and their associated factors before and during the covid-19 pandemic in nigeria. Vaccines 11. URL: https://doi.org/10.3390/vaccines11121830, doi:10.3390/vaccines11121830.
  4. Exploring the spatial-visual locality of geo-tagged urban street images, in: 2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR), IEEE. pp. 104–110.
  5. Data-driven crowd simulation with generative adversarial networks, in: Proceedings of the 32nd International Conference on Computer Animation and Social Agents, pp. 7–10.
  6. Agent-based modeling in translational systems biology. Complex Systems and Computational Biology Approaches to Acute Inflammation: A Framework for Model-based Precision Medicine , 31–52.
  7. Event location detection from online clustering algorithms using geo-tagged user data in social streams, in: Disruptive Technologies for Big Data and Cloud Applications: Proceedings of ICBDCC 2021. Springer, pp. 227–235.
  8. Climate-driven risks to the climate mitigation potential of forests. Science 368, eaaz7005.
  9. Grvs: a georeferenced video search engine, in: Proceedings of the 17th ACM International Conference on Multimedia, pp. 977–978.
  10. Geographic mapping with unsupervised multi-modal representation learning from VHR images and POIs. ISPRS Journal of Photogrammetry and Remote Sensing 201, 193–208. doi:10.1016/j.isprsjprs.2023.05.006.
  11. Stg2seq: Spatial-temporal graph to sequence model for multi-step passenger demand forecasting. arXiv preprint arXiv:1905.10069 .
  12. Spatio-temporal graph convolutional and recurrent networks for citywide passenger demand prediction, in: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 2293–2296.
  13. Geospatial entity resolution, in: Proceedings of the ACM Web Conference 2022, Association for Computing Machinery, New York, NY, USA. pp. 3061–3070. doi:10.1145/3485447.3512026.
  14. Multimodal machine learning: A survey and taxonomy. IEEE transactions on pattern analysis and machine intelligence 41, 423–443.
  15. Human mobility: Models and applications. Physics Reports 734, 1–74.
  16. Milp model for fleet and charging infrastructure decisions for fast-charging city electric bus services. Computers & Industrial Engineering , 109336.
  17. Models for traffic control. JOURNAL A 43, 13–22.
  18. Street view imagery in urban analytics and gis: A review. Landscape and Urban Planning 215, 104217.
  19. Pre-trained semantic embeddings for POI categories based on multiple contexts. IEEE Transactions on Knowledge and Data Engineering 35, 8893–8904. doi:10.1109/TKDE.2022.3218851.
  20. Geospatial data management research: Progress and future directions. ISPRS International Journal of Geo-Information 9, 95.
  21. Video generation models as world simulators URL: https://openai.com/research/video-generation-models-as-world-simulators.
  22. Language models are few-shot learners. Advances in neural information processing systems 33, 1877–1901.
  23. Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712 .
  24. Automatic construction of POI address lists at city streets from geo-tagged photos and web data: a case study of San Jose City. Multimedia Tools and Applications , 1–22.
  25. Using satellite imagery to understand and promote sustainable development. Science 371, eabe8628.
  26. Comparison of efficiency between public and private transport modes using excess commuting: An experience in dar es salaam. Journal of Transport Geography 82, 102616.
  27. M 2 g4rtp: A multi-level and multi-task graph model for instant-logistics route and time joint prediction, in: 2023 IEEE 39th International Conference on Data Engineering (ICDE), IEEE. pp. 3296–3308.
  28. Tempo: Prompt-based generative pre-trained transformer for time series forecasting. arXiv preprint arXiv:2310.04948 .
  29. A survey on generative diffusion model. arXiv preprint arXiv:2209.02646 .
  30. Points-of-interest recommendation algorithm based on LBSN in edge computing environment. IEEE Access 8, 47973–47983. doi:10.1109/ACCESS.2020.2979922.
  31. Dirty density: Air quality and the density of american cities. Journal of Environmental Economics and Management 118, 102767.
  32. NodeSense2Vec: Spatiotemporal context-aware network embedding for heterogeneous urban mobility data, in: 2021 IEEE International Conference on Big Data (Big Data), pp. 2884–2893. doi:10.1109/BigData52589.2021.9672072.
  33. Llm4ts: Two-stage fine-tuning for time-series forecasting with pre-trained llms. arXiv preprint arXiv:2308.08469 .
  34. Mapping essential urban land use categories (euluc) using geospatial big data: Progress, challenges, and opportunities. Big Earth Data 5, 410–441.
  35. Daily weather forecasting based on deep learning model: A case study of shenzhen city, china. Atmosphere 13, 1208.
  36. A cross-city federated transfer learning framework: A case study on urban region profiling. doi:10.48550/arXiv.2206.00007, arXiv:2206.00007.
  37. Quantifying the green view indicator for assessing urban greening quality: An analysis based on internet-crawling street view data. Ecological Indicators 113, 106192.
  38. RADAR: Road obstacle identification for disaster response leveraging cross-domain urban data. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 1, 1–23. doi:10.1145/3161159.
  39. UVLens: Urban village boundary identification and population estimation leveraging open government data. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 5, 57:1–57:26. doi:10.1145/3463495.
  40. Impact of urbanization on ecosystem health in chinese urban agglomerations. Environmental Impact Assessment Review 98, 106964.
  41. On information coverage for location category based point-of-interest recommendation. Proceedings of the AAAI Conference on Artificial Intelligence 29. doi:10.1609/aaai.v29i1.9191.
  42. Location- and keyword-based querying of geo-textual data: a survey. The VLDB Journal 30, 603–640. URL: https://doi.org/10.1007/s00778-021-00661-w, doi:10.1007/s00778-021-00661-w.
  43. Towards understanding the mixture-of-experts layer in deep learning, in: Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K., Oh, A. (Eds.), Advances in Neural Information Processing Systems, Curran Associates, Inc.. pp. 23049–23062. URL: https://proceedings.neurips.cc/paper_files/paper/2022/file/91edff07232fb1b55a505a9e9f6c0ff3-Paper-Conference.pdf.
  44. Prior water availability modifies the effect of heavy rainfall on dengue transmission: a time series analysis of passive surveillance data from southern china. Frontiers in Public Health URL: https://doi.org/10.3389/fpubh.2023.1287678, doi:10.3389/fpubh.2023.1287678.
  45. Review of image data augmentation in computer vision. Journal of Frontiers of Computer Science & Technology 15, 583.
  46. Nus-wide: A real-world web image database from national university of singapore, in: ACM International Conference on Image and Video Retrieval, pp. 48:1–48:9.
  47. CondorFerries, 2023. Explore solo travel trends & stats by demographics, destination, industry & why solo travel continues to rise! https://www.condorferries.co.uk/solo-travel-statistics/,.
  48. Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery. Advances in Neural Information Processing Systems 35, 197–211.
  49. So what do we call twitter now anyway? The New York Times Archived from the original on October 12, 2023. Retrieved August 29, 2023.
  50. Chatlaw: Open-source legal large language model with integrated external knowledge bases. arXiv preprint arXiv:2306.16092 .
  51. Personalized route recommendation using big trajectory data, in: 2015 IEEE 31st international conference on data engineering, IEEE. pp. 543–554.
  52. Instructblip: Towards general-purpose vision-language models with instruction tuning. arxiv 2023. arXiv preprint arXiv:2305.06500 .
  53. Flashattention-2: Faster attention with better parallelism and work partitioning. arXiv preprint arXiv:2307.08691 .
  54. Flashattention: Fast and memory-efficient exact attention with io-awareness. Advances in Neural Information Processing Systems 35, 16344–16359.
  55. Beyond just vision: A review on self-supervised representation learning on multimodal and temporal data. arXiv preprint arXiv:2206.02353 .
  56. Virtex: Learning visual representations from textual annotations, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11162–11173.
  57. Llm. int8 (): 8-bit matrix multiplication for transformers at scale. arXiv preprint arXiv:2208.07339 .
  58. Qlora: Efficient finetuning of quantized llms. arXiv preprint arXiv:2305.14314 .
  59. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 .
  60. Mgeo: Multi-modal geographic language model pre-training, in: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 185–194.
  61. Cities and transportation. Traffic and Safety Sciences—Interdisciplinary Wisdom of IATSS, International Association of Traffic and Safety Sciences , 12–21.
  62. Beyond geo-first law: Learning spatial representations via integrated autocorrelations and complementarity, in: 2019 IEEE International Conference on Data Mining (ICDM), pp. 160–169. doi:10.1109/ICDM.2019.00026.
  63. Noisesense: Crowdsourced context aware sensing for real time noise pollution monitoring of the city, in: 2017 IEEE international conference on advanced networks and telecommunications systems (ANTS), IEEE. pp. 1–6.
  64. Differential privacy: A survey of results, in: International conference on theory and applications of models of computation, Springer. pp. 1–19.
  65. Deepfec: energy consumption prediction under real-world driving conditions for smart cities, in: Proceedings of the Web Conference 2021, pp. 1880–1890.
  66. Urban visual intelligence: Uncovering hidden city profiles with street view images. Proceedings of the National Academy of Sciences 120, e2220417120. URL: https://www.pnas.org/doi/abs/10.1073/pnas.2220417120, doi:10.1073/pnas.2220417120, arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.2220417120.
  67. A top-k POI recommendation approach based on LBSN and multi-graph fusion. Neurocomputing 518, 219–230. doi:10.1016/j.neucom.2022.10.048.
  68. The economy needs agent-based modelling. Nature 460, 685–686.
  69. Knowledge fusion for probabilistic generative classifiers with data mining applications. IEEE Transactions on Knowledge and Data Engineering 26, 652–666.
  70. Computational approaches in rigorous sociology: agent-based computational modeling and computational social science. Handbook of Sociological Science , 57–72.
  71. Enhancing pipeline-based conversational agents with large language models. arXiv preprint arXiv:2309.03748 .
  72. Efficient region embedding with multi-view spatial networks: A perspective of locality-constrained spatial autocorrelations, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 906–913.
  73. Causal inference in recommender systems: A survey and future directions. ACM Transactions on Information Systems .
  74. Generative adversarial networks for spatio-temporal data: A survey. ACM Transactions on Intelligent Systems and Technology (TIST) 13, 1–25.
  75. Dual-grained human mobility learning for location-aware trip recommendation with spatial–temporal graph knowledge fusion. Information Fusion 92, 46–63.
  76. Dual-grained human mobility learning for location-aware trip recommendation with spatial–temporal graph knowledge fusion. Information Fusion 92, 46–63. doi:10.1016/j.inffus.2022.11.018.
  77. Contextual spatio-temporal graph representation learning for reinforced human mobility mining. Information Sciences 606, 230–249.
  78. Magicdrive: Street view generation with diverse 3d geometry control. arXiv preprint arXiv:2310.02601 .
  79. The leverage cycle. NBER macroeconomics annual 24, 1–66.
  80. Spatiotemporal multi-graph convolution network for ride-hailing demand forecasting, in: Proceedings of the AAAI conference on artificial intelligence, pp. 3656–3663.
  81. Multimodal masked autoencoders learn transferable representations. arXiv preprint arXiv:2205.14204 .
  82. Multi-modal graph interaction for multi-graph convolution network in urban spatiotemporal forecasting. doi:10.48550/arXiv.1905.11395, arXiv:1905.11395.
  83. GeoVid Project, . GeoVid Project. URL: http://geovid.org/.
  84. Flow control: A comparative survey. IEEE Transactions on Communications 28, 553–574.
  85. Forecasting of ozone concentration in smart city using deep learning, in: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), IEEE. pp. 1320–1326.
  86. Classifying street spaces with street view images for a spatial indicator of urban functions. Sustainability 11, 6424.
  87. Predicting human-wildlife interaction in urban environments through agent-based models. Landscape and Urban Planning 240, 104878.
  88. Knowledge distillation: A survey. International Journal of Computer Vision 129, 1789–1819.
  89. Context-aware, preference-based vehicle routing. The VLDB Journal 29, 1149–1170.
  90. A nonparametric model for event discovery in the geospatial-temporal space, in: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, Association for Computing Machinery, New York, NY, USA. pp. 499–508. doi:10.1145/2983323.2983790.
  91. A force-directed approach to seeking route recommendation in ride-on-demand service using multi-source urban data. IEEE Transactions on Mobile Computing 21, 1909–1926. doi:10.1109/TMC.2020.3033274.
  92. Rod-revenue: Seeking strategies analysis and revenue prediction in ride-on-demand service using multi-source urban data. IEEE Transactions on Mobile Computing 19, 2202–2220.
  93. A multi-dimensional crime spatial pattern analysis and prediction model based on classification. ETRI Journal 43, 272–287.
  94. A graph-based approach for trajectory similarity computation in spatial networks, in: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pp. 556–564.
  95. Urban computing for sustainable smart cities: Recent advances, taxonomy, and open research challenges. Sustainability 15, 3916.
  96. A joint context-aware embedding for trip recommendations, in: 2019 IEEE 35th International Conference on Data Engineering (ICDE), IEEE. pp. 292–303.
  97. An optimal charging station location model with the consideration of electric vehicle’s driving range. Transportation Research Part C: Emerging Technologies 86, 641–654.
  98. The first high-resolution meteorological forcing dataset for land process studies over China. Scientific Data 7, 25.
  99. Masked autoencoders are scalable vision learners, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 16000–16009.
  100. Training and analysing deep recurrent neural networks. Advances in neural information processing systems 26.
  101. Graphmae: Self-supervised masked graph autoencoders, in: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 594–604.
  102. Rsgpt: A remote sensing vision language model and benchmark. arXiv preprint arXiv:2307.15266 .
  103. Unlocking the potential of user feedback: Leveraging large language model as user simulator to enhance dialogue system. arXiv preprint arXiv:2306.09821 .
  104. Exploiting spatial-temporal-social constraints for localness inference using online social media, in: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 287–294. doi:10.1109/ASONAM.2016.7752247.
  105. Deepcrime: Attentive hierarchical recurrent networks for crime prediction, in: Proceedings of the 27th ACM international conference on information and knowledge management, pp. 1423–1432.
  106. Deep learning in finance and banking: A literature review and classification. Frontiers of Business Research in China 14, 1–24.
  107. ERNIE-GeoL: A geography-and-language pre-trained model and its applications in baidu maps, in: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 3029–3039. doi:10.1145/3534678.3539021, arXiv:2203.09127.
  108. Real-time driver behavior detection based on deep deformable inverted residual network with an attention mechanism for human-vehicle co-driving system. IEEE Transactions on Vehicular Technology 71, 12475–12488.
  109. Comprehensive urban space representation with varying numbers of street-level images. Computers, Environment and Urban Systems 106, 102043. URL: https://www.sciencedirect.com/science/article/pii/S0198971523001060, doi:https://doi.org/10.1016/j.compenvurbsys.2023.102043.
  110. Comprehensive urban space representation with varying numbers of street-level images. Computers, Environment and Urban Systems 106, 102043. doi:10.1016/j.compenvurbsys.2023.102043.
  111. Air quality prediction in smart cities using machine learning technologies based on sensor data: a review. Applied Sciences 10, 2401.
  112. Structural-rnn: Deep learning on spatio-temporal graphs, in: Proceedings of the ieee conference on computer vision and pattern recognition, pp. 5308–5317.
  113. Unsupervised representation learning of spatial data via multimodal embedding, in: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, Association for Computing Machinery, New York, NY, USA. pp. 1993–2002. doi:10.1145/3357384.3358001.
  114. Smart city: A system for measuring noise pollution. Smart Cities and Regional Development (SCRD) Journal 2, 79–85.
  115. A crowdsensing platform for real-time monitoring and analysis of noise pollution in smart cities. Sustainable Computing: Informatics and Systems 31, 100588.
  116. Spatio-temporal self-supervised learning for traffic flow prediction, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 4356–4364.
  117. Self-supervised spatiotemporal graph neural networks with self-distillation for traffic prediction. IEEE Transactions on Intelligent Transportation Systems 24, 1580–1593. doi:10.1109/TITS.2022.3219626.
  118. Urban sensing based on human mobility, in: Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, pp. 1040–1051.
  119. Sman: Stacked multimodal attention network for cross-modal image–text retrieval. IEEE transactions on cybernetics 52, 1086–1097.
  120. DeepUrbanEvent: A system for predicting citywide crowd dynamics at big events, in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Association for Computing Machinery, New York, NY, USA. pp. 2114–2122. doi:10.1145/3292500.3330654.
  121. ITV: Inferring traffic violation-prone locations with vehicle trajectories and road environment data. IEEE Systems Journal 15, 3913–3924. doi:10.1109/JSYST.2020.3012743.
  122. Spatio-temporal graph neural networks for predictive learning in urban computing: A survey. IEEE Transactions on Knowledge and Data Engineering , 1–20doi:10.1109/TKDE.2023.3333824.
  123. Time-llm: Time series forecasting by reprogramming large language models. arXiv preprint arXiv:2310.01728 .
  124. Urban building energy modeling: State of the art and future prospects. Renewable and Sustainable Energy Reviews 128, 109902.
  125. Air quality prediction: Big data and machine learning approaches. Int. J. Environ. Sci. Dev 9, 8–16.
  126. A review of urban physical environment sensing using street view imagery in public health studies. Annals of GIS 26, 261–275.
  127. Scaling laws for neural language models. arXiv preprint arXiv:2001.08361 .
  128. Joint predictions of multi-modal ride-hailing demands: A deep multi-task multi-graph learning-based approach. Transportation Research Part C: Emerging Technologies 127, 103063.
  129. Spatio-temporal contrastive self-supervised learning for poi-level crowd flow inference. arXiv preprint arXiv:2309.03239 .
  130. Collective embedding with feature importance: A unified approach for spatiotemporal network embedding, in: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, Association for Computing Machinery, New York, NY, USA. pp. 615–624. doi:10.1145/3340531.3412030.
  131. Energy use and urbanization as determinants of china’s environmental quality: prospects of the paris climate agreement. Journal of Environmental Planning and Management 65, 2363–2386.
  132. Diffusionsat: A generative foundation model for satellite imagery. arXiv preprint arXiv:2312.03606 .
  133. Mediaq: mobile multimedia management system, in: Proceedings of the 5th ACM Multimedia Systems Conference, pp. 224–235.
  134. A deep learning model for air quality prediction in smart cities, in: 2017 IEEE international conference on big data (big data), IEEE. pp. 1983–1990.
  135. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25.
  136. Dependencies between demographic urbanization and the agglomeration road traffic volumes: Evidence from poland. Land 10, 47.
  137. Geochat: Grounded large vision-language model for remote sensing. arXiv preprint arXiv:2311.15826 .
  138. Masked vision and language modeling for multi-modal representation learning. arXiv preprint arXiv:2208.02131 .
  139. Spatio-temporal graph mixformer for traffic forecasting. Expert Systems with Applications 228, 120281.
  140. A preference-aware meta-optimization framework for personalized vehicle energy consumption estimation, in: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, New York, NY, USA. pp. 4346–4356. doi:10.1145/3580305.3599767.
  141. Climabench: A benchmark dataset for climate change text understanding in english. arXiv preprint arXiv:2301.04253 .
  142. Contrastive representation learning: A framework and review. Ieee Access 8, 193907–193934.
  143. Dynamic graph convolutional recurrent network for traffic prediction: Benchmark and solution. ACM Trans. Knowl. Discov. Data 17. URL: https://doi.org/10.1145/3532611, doi:10.1145/3532611.
  144. Ood-gnn: Out-of-distribution generalized graph neural network. IEEE Transactions on Knowledge and Data Engineering .
  145. Urbanization and rural–urban consumption disparity: Evidence from china. The Singapore Economic Review 64, 983–996.
  146. Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models. arXiv preprint arXiv:2301.12597 .
  147. Align before fuse: Vision and language representation learning with momentum distillation. Advances in neural information processing systems 34, 9694–9705.
  148. A survey on federated learning systems: Vision, hype and reality for data privacy and protection. IEEE Transactions on Knowledge and Data Engineering .
  149. Predicting multi-level socioeconomic indicators from structural urban imagery, in: Proceedings of the 31st ACM International Conference on Information & Knowledge Management, pp. 3282–3291.
  150. HMGCL: Heterogeneous multigraph contrastive learning for LBSN friend recommendation. World Wide Web 26, 1625–1648. doi:10.1007/s11280-022-01092-5.
  151. Urban region representation learning with OpenStreetMap building footprints, in: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, New York, NY, USA. pp. 1363–1373. doi:10.1145/3580305.3599538.
  152. Pare: A system for personalized route guidance, in: Proceedings of the 26th International Conference on World Wide Web, International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE. p. 637–646. URL: https://doi.org/10.1145/3038912.3052717, doi:10.1145/3038912.3052717.
  153. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. arXiv preprint arXiv:1707.01926 .
  154. Losparse: Structured compression of large language models based on low-rank and sparse approximation. arXiv preprint arXiv:2306.11222 .
  155. The effect of urbanization on environmental pollution in rapidly developing urban agglomerations. Journal of cleaner production 237, 117649.
  156. Geoman: Multi-level attention networks for geo-sensory time series prediction., in: IJCAI, pp. 3428–3434.
  157. Urbanfm: Inferring fine-grained urban flows, in: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp. 3132–3142.
  158. Fine-grained urban flow prediction, in: Proceedings of the Web Conference 2021, Association for Computing Machinery, New York, NY, USA. pp. 1833–1845. doi:10.1145/3442381.3449792.
  159. Airformer: Predicting nationwide air quality in china with transformers, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 14329–14337.
  160. A causal inference look at unsupervised video anomaly detection, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 1620–1629.
  161. Llm-eval: Unified multi-dimensional automatic evaluation for open-domain conversations with large language models. arXiv preprint arXiv:2305.13711 .
  162. Deepstn+: Context-aware spatial-temporal neural network for crowd flow prediction in metropolis, in: Proceedings of the AAAI conference on artificial intelligence, pp. 1020–1027.
  163. Spatio-temporal adaptive embedding makes vanilla transformer sota for traffic forecasting, in: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, pp. 4125–4129.
  164. Characterizing and forecasting urban vibrancy evolution: A multi-view graph mining perspective. ACM Transactions on Knowledge Discovery from Data 17, 68:1–68:24. doi:10.1145/3568683.
  165. Unified route representation learning for multi-modal transportation recommendation with spatiotemporal pre-training. The VLDB Journal — The International Journal on Very Large Data Bases 32, 325–342. doi:10.1007/s00778-022-00748-y.
  166. Improved baselines with visual instruction tuning. arXiv preprint arXiv:2310.03744 .
  167. Joint representation learning for multi-modal transportation recommendation. Proceedings of the AAAI Conference on Artificial Intelligence 33, 1036–1043. doi:10.1609/aaai.v33i01.33011036.
  168. Joint representation learning for multi-modal transportation recommendation, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 1036–1043.
  169. Urban big data fusion based on deep learning: An overview. Information Fusion 53, 123–133.
  170. Symbolic aggregate approximation based data fusion model for dangerous driving behavior detection. Information Sciences 609, 626–643.
  171. A review on remote sensing data fusion with generative adversarial networks (gan). Authorea Preprints .
  172. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Computing Surveys 55, 1–35.
  173. Unitime: A language-empowered unified model for cross-domain time series forecasting. arXiv preprint arXiv:2310.09751 .
  174. Largest: A benchmark dataset for large-scale traffic forecasting. arXiv preprint arXiv:2306.08259 .
  175. Self-supervised learning: Generative or contrastive. IEEE transactions on knowledge and data engineering 35, 857–876.
  176. Spatiotemporal activity modeling via hierarchical cross-modal embedding. IEEE Transactions on Knowledge and Data Engineering 34, 462–474. doi:10.1109/TKDE.2020.2983892.
  177. Internet of things for noise mapping in smart cities: state of the art and future directions. IEEE Network 34, 112–118.
  178. Knowledge-infused contrastive learning for urban imagery-based socioeconomic prediction, in: Proceedings of the ACM Web Conference 2023, pp. 4150–4160.
  179. Geougv: User-generated mobile video dataset with fine granularity spatial metadata, in: Proceedings of the 7th international conference on multimedia systems, pp. 1–6.
  180. Let trajectories speak out the traffic bottlenecks. doi:10.48550/arXiv.2107.12948, arXiv:2107.12948.
  181. Spatiotemporal variations of “triple-demic” outbreaks of respiratory infections in the united states in the post-covid-19 era. BMC Public Health 23. URL: https://doi.org/10.1186/s12889-023-17406-9, doi:10.1186/s12889-023-17406-9.
  182. Dual-level collaborative transformer for image captioning, in: Proceedings of the AAAI conference on artificial intelligence, pp. 2286–2293.
  183. Lc-rnn: A deep learning model for traffic speed prediction., in: IJCAI, p. 27th.
  184. HiSTGNN: Hierarchical spatio-temporal graph neural network for weather forecasting. Information Sciences 648, 119580.
  185. Sainf: Stay area inference of vehicles using surveillance camera records .
  186. Deep learning in mining biological data. Cognitive computation 13, 1–33.
  187. Geollm: Extracting geospatial knowledge from large language models. arXiv preprint arXiv:2310.06213 .
  188. Jointly contrastive representation learning on road network and trajectory, in: Proceedings of the 31st ACM International Conference on Information & Knowledge Management, pp. 1501–1510.
  189. MediaQ Project, . MediaQ Project. URL: http://mediaq1.cloudapp.net/home/.
  190. Tobler’s first law and spatial analysis. Annals of the association of American geographers 94, 284–289.
  191. Public transportation and sustainability: A review. KSCE Journal of Civil Engineering 20, 1076–1083.
  192. Integrating gps trajectory and topics from twitter stream for human mobility estimation. Frontiers of Computer Science 13, 460–470.
  193. Deep learning-based vehicle behavior prediction for autonomous driving applications: A review. IEEE Transactions on Intelligent Transportation Systems 23, 33–47.
  194. Understanding multimodal contrastive learning and incorporating unpaired data, in: International Conference on Artificial Intelligence and Statistics, PMLR. pp. 4348–4380.
  195. Realroi: Discovering real regions of interest from geotagged photos. IEEE Access 10, 83489–83497.
  196. A study and analysis of recommendation systems for location-based social network (LBSN) with big data. IIMB Management Review 28, 25–30. doi:10.1016/j.iimb.2016.01.001.
  197. Driver behavior modeling towards autonomous vehicles: Comprehensive review. IEEE Access .
  198. Causality: models, reasoning, and inference, by judea pearl, cambridge university press, 2000. Econometric Theory 19, 675–685.
  199. Deep learning-based semantic segmentation of urban features in satellite images: A review and meta-analysis. Remote Sensing 13, 808.
  200. Mapping urban environmental performance with emerging data sources: A case of urban greenery and traffic noise in sydney, australia. Sustainability 13, 605.
  201. Vehicle Energy Dataset (VED), a large-scale dataset for vehicle energy consumption research. IEEE Transactions on Intelligent Transportation Systems 23, 3302–3312. URL: https://api.semanticscholar.org/CorpusID:146120975.
  202. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 .
  203. OpenAI, 2023. Gpt-4 technical report. arXiv:2303.08774.
  204. Fine-grained urban flow inference. IEEE transactions on knowledge and data engineering 34, 2755–2770.
  205. Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems 35, 27730–27744.
  206. Deep learning for financial applications: A survey. Applied Soft Computing 93, 106384.
  207. Spatial-temporal graph contrastive learning for urban traffic flow forecasting. Authorea Preprints .
  208. Urban traffic prediction from spatio-temporal data using deep meta learning, in: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp. 1720–1730.
  209. Spatio-temporal meta learning for urban traffic prediction. IEEE Transactions on Knowledge and Data Engineering 34, 1462–1476.
  210. How use of location-based social network (LBSN) services contributes to accumulation of social capital. Social Indicators Research 136, 379–396. doi:10.1007/s11205-016-1525-9.
  211. Causal inference in statistics: A primer. 2016. Internet resource .
  212. Generating efficient training data via llm-based attribute manipulation. arXiv preprint arXiv:2307.07099 .
  213. Quantifying the impacts of climate change and extreme climate events on energy systems. Nature Energy 5, 150–159.
  214. Points of interest (poi): a commentary on the state of the art, challenges, and prospects for the future. Computational Urban Science 2, 20.
  215. Modeling intra-and inter-community information for route and time prediction in last-mile delivery, in: 2023 IEEE 39th International Conference on Data Engineering (ICDE), IEEE. pp. 3106–3112.
  216. Forecasting fine-grained urban flows via spatio-temporal contrastive self-supervision. IEEE Transactions on Knowledge and Data Engineering .
  217. Learning transferable visual models from natural language supervision, in: International conference on machine learning, PMLR. pp. 8748–8763.
  218. Modeling multi-regional temporal correlation with gated recurrent unit and multiple linear regression for urban traffic flow prediction. Knowledge-Based Systems 262. doi:10.1016/j.knosys.2022.110237.
  219. Alignment in multimodal interaction: An integrative framework. Cognitive science 44, e12911.
  220. Scale-mae: A scale-aware masked autoencoder for multiscale geospatial representation learning, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 4088–4099.
  221. Sensitivity of hydrology and water quality to variation in land use and land cover data. Agricultural Water Management 241, 106366.
  222. Gpt4geo: How a language model sees the world’s geography. arXiv preprint arXiv:2306.00020 .
  223. Exploring the relationship between temporal fluctuations in satellite nightlight imagery and human mobility across africa. Remote Sensing 15. URL: https://doi.org/10.3390/rs15174252, doi:10.3390/rs15174252.
  224. High-resolution image synthesis with latent diffusion models, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10684–10695.
  225. Service time prediction for delivery tasks via spatial meta-learning, in: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 3829–3837.
  226. Climatebert-netzero: Detecting and assessing net zero and reduction targets. arXiv preprint arXiv:2310.08096 .
  227. Urban mobility report 2019 .
  228. Applications of artificial neural networks in health care organizational decision-making: A scoping review. PloS one 14, e0212356.
  229. Forecasting citywide traffic congestion based on social media. Wireless Personal Communications 103, 1037–1057.
  230. Scaling vision-language models with sparse mixture of experts. arXiv preprint arXiv:2303.07226 .
  231. Hugginggpt: Solving ai tasks with chatgpt and its friends in huggingface. arXiv preprint arXiv:2303.17580 .
  232. Convolutional lstm network: A machine learning approach for precipitation nowcasting. Advances in neural information processing systems 28.
  233. Large language models encode clinical knowledge. Nature 620, 172–180.
  234. Sustainable personal transport modes in a life cycle perspective—public or private? Sustainability 11, 7092.
  235. Monitoring finer-scale population density in urban functional zones: A remote sensing data fusion approach. Landscape and urban planning 190, 103580.
  236. Deep learning prediction of incoming rainfalls: An operational service for the city of beijing china, in: 2019 International Conference on Data Mining Workshops (ICDMW), IEEE. pp. 180–185.
  237. Deeptransport: Prediction and simulation of human mobility and transportation mode at a citywide level, in: Proceedings of the twenty-fifth international joint conference on artificial intelligence, pp. 2618–2624.
  238. DeepMob: Learning deep knowledge of human emergency behavior and mobility from big and heterogeneous data. ACM Transactions on Information Systems 35, 41:1–41:19. doi:10.1145/3057280.
  239. Measuring urban sprawl using land use data. Land Use Policy 97, 104799.
  240. Interpretability of machine learning-based prediction models in healthcare. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 10, e1379.
  241. Population research: convenience sampling strategies. Prehospital and disaster Medicine 36, 373–374.
  242. Predicting citywide crowd flows in irregular regions using multi-view graph convolutional networks. IEEE Transactions on Knowledge and Data Engineering 34, 2348–2359.
  243. Battery swapping dispatch for self-sustained highway energy system based on spatiotemporal deep-learning traffic flow prediction. IEEE Transactions on Industry Applications .
  244. Approximating online human evaluation of social chatbots with prompting, in: Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, pp. 268–281.
  245. A study of tourist sequential activity pattern through location based social network (LBSN), in: 2018 International Conference on Orange Technologies (ICOT), pp. 1–8. doi:10.1109/ICOT.2018.8705895.
  246. Recent advances of deep learning in bioinformatics and computational biology. Frontiers in genetics 10, 214.
  247. Spatio-temporal meta contrastive learning, in: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, pp. 2412–2421.
  248. Air quality and health co-benefits of china’s carbon dioxide emissions peaking before 2030. Nature communications 13, 1008.
  249. Similar trajectory search with spatio-temporal deep representation learning. ACM Transactions on Intelligent Systems and Technology (TIST) 12, 1–26.
  250. Large language models in medicine. Nature medicine 29, 1930–1940.
  251. The new data and new challenges in multimedia research. arXiv 1. arXiv:1503.01817.
  252. Debiasing nlu models via causal intervention and counterfactual reasoning, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 11376–11384.
  253. Generative information fusion, in: ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE. pp. 3990–3994.
  254. The projected timing of abrupt ecological disruption from climate change. Nature 580, 496–501.
  255. Optimizing the locations of electric taxi charging stations: A spatial–temporal demand coverage approach. Transportation Research Part C: Emerging Technologies 65, 172–189.
  256. Attention is all you need. Advances in neural information processing systems 30.
  257. Adapting text embeddings for causal inference, in: Conference on Uncertainty in Artificial Intelligence, PMLR. pp. 919–928.
  258. GeoSocialBound: An efficient framework for estimating social POI boundaries using spatio–textual information, in: Proceedings of the Third International ACM SIGMOD Workshop on Managing and Mining Enriched Geo-Spatial Data, ACM, San Francisco California. pp. 1–6. doi:10.1145/2948649.2948652.
  259. The prevalence of ms in the united states: a population-based estimate using health claims data. Neurology 92, e1029–e1040.
  260. GSNet: Learning spatial-temporal correlations from geographical and semantic aspects for traffic accident risk forecasting. Proceedings of the AAAI Conference on Artificial Intelligence 35, 4402–4409. doi:10.1609/aaai.v35i5.16566.
  261. Deep human-guided conditional variational generative modeling for automated urban planning, in: 2021 IEEE international conference on data mining (ICDM), IEEE. pp. 679–688.
  262. Human-instructed deep hierarchical generative learning for automated urban planning, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 4660–4667.
  263. Customizing 360-degree panoramas through text-to-image diffusion models, in: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 4933–4943.
  264. Spatio-temporal urban knowledge graph enabled mobility prediction. doi:10.48550/arXiv.2111.03465, arXiv:2111.03465.
  265. Spatio-temporal urban knowledge graph enabled mobility prediction. Proceedings of the ACM on interactive, mobile, wearable and ubiquitous technologies 5, 1–24.
  266. Developing an early-warning system for air quality prediction and assessment of cities in china. Expert systems with applications 84, 102–116.
  267. A survey on large language model based autonomous agents. arXiv preprint arXiv:2308.11432 .
  268. Does urbanization lead to less residential energy consumption? a comparative study of 136 countries. Energy 202, 117765.
  269. Deep learning for spatio-temporal data mining: A survey. IEEE Transactions on Knowledge and Data Engineering 34, 3681–3700.
  270. Estimating urban traffic congestions with multi-sourced data, in: 2016 17th IEEE International conference on mobile data management (MDM), IEEE. pp. 82–91.
  271. Citywide traffic congestion estimation with social media, in: Proceedings of the 23rd SIGSPATIAL international conference on advances in geographic information systems, pp. 1–10.
  272. Enhancing traffic congestion estimation with social media by coupled hidden markov model, in: Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2016, Riva del Garda, Italy, September 19-23, 2016, Proceedings, Part II 16, Springer. pp. 247–264.
  273. Pm2. 5-gnn: A domain knowledge enhanced graph neural network for pm2. 5 forecasting, in: Proceedings of the 28th international conference on advances in geographic information systems, pp. 163–166.
  274. Traffic accident risk prediction via multi-view multi-task spatio-temporal networks. IEEE Transactions on Knowledge and Data Engineering .
  275. Traffic accident risk prediction via multi-view multi-task spatio-temporal networks. IEEE Transactions on Knowledge and Data Engineering 35, 12323–12336. doi:10.1109/TKDE.2021.3135621.
  276. Computing urban traffic congestions by incorporating sparse gps probe data and social media data. ACM Transactions on Information Systems (TOIS) 35, 1–30.
  277. Visual commonsense r-cnn, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10760–10770.
  278. Where would i go next? large language models as human mobility predictors. arXiv preprint arXiv:2308.15197 .
  279. Personalized long-distance fuel-efficient route recommendation through historical trajectories mining, in: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining, ACM, Virtual Event AZ USA. pp. 1072–1080. doi:10.1145/3488560.3498512.
  280. Mapping an urban boundary based on multi-temporal sentinel-2 and POI data: A case study of zhengzhou city. Remote Sensing 12, 4103. doi:10.3390/rs12244103.
  281. Simvlm: Simple visual language model pretraining with weak supervision. arXiv preprint arXiv:2108.10904 .
  282. Multi-modality cross attention network for image and sentence matching, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10941–10950.
  283. Modeling spatial–temporal constraints and spatial-transfer patterns for couriers’ package pick-up route prediction. IEEE Transactions on Intelligent Transportation Systems .
  284. Enough waiting for the couriers: Learning to estimate package pick-up arrival time from couriers’ spatial-temporal behaviors. ACM Transactions on Intelligent Systems and Technology 14, 1–22.
  285. Diffstg: Probabilistic spatio-temporal graph forecasting with denoising diffusion models, in: Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, pp. 1–12.
  286. Semantic annotation of mobility data using social media, in: Proceedings of the 24th international conference on world wide web, pp. 1253–1263.
  287. Mining spatio-temporal reachable regions over massive trajectory data, in: 2017 IEEE 33rd International Conference on Data Engineering (ICDE), pp. 1283–1294. doi:10.1109/ICDE.2017.171.
  288. How does energy consumption affect china’s urbanization? new evidence from dynamic threshold panel models. Energy policy 127, 24–38.
  289. G2ptl: A pre-trained model for delivery address and its applications in logistics system. arXiv preprint arXiv:2304.01559 .
  290. Lade: The first comprehensive last-mile delivery dataset from industry. arXiv preprint arXiv:2306.10675 .
  291. Bloomberggpt: A large language model for finance. arXiv preprint arXiv:2303.17564 .
  292. Updating road networks by local renewal from gps trajectories. ISPRS International Journal of Geo-Information 5, 163.
  293. Graph wavenet for deep spatial-temporal graph modeling. arXiv preprint arXiv:1906.00121 .
  294. Beyond the first law of geography: Learning representations of satellite imagery by leveraging point-of-interests, in: Proceedings of the ACM Web Conference 2022, ACM, Virtual Event, Lyon France. pp. 3308–3316. doi:10.1145/3485447.3512149.
  295. The rise and potential of large language model based agents: A survey. arXiv preprint arXiv:2309.07864 .
  296. Spatial-temporal sequential hypergraph network for crime prediction with dynamic multiplex relation learning, in: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, International Joint Conferences on Artificial Intelligence Organization.
  297. Deciphering spatio-temporal graph forecasting: A causal lens and treatment. Advances in Neural Information Processing Systems 36.
  298. A contextual master-slave framework on urban region graph for urban village detection, in: 2023 IEEE 39th International Conference on Data Engineering (ICDE), IEEE Computer Society. pp. 736–748. doi:10.1109/ICDE55515.2023.00062.
  299. Quert: Continual pre-training of language model for query understanding in travel domain search. arXiv preprint arXiv:2306.06707 .
  300. Urban flow prediction from spatiotemporal data using machine learning: A survey. Information Fusion 59, 1–12.
  301. Towards multi-dimensional knowledge-aware approach for effective community detection in LBSN. World Wide Web 26, 1435–1458. doi:10.1007/s11280-022-01101-7.
  302. Urban generative intelligence (ugi): A foundational platform for agents in embodied city environment. arXiv preprint arXiv:2312.11813 .
  303. Dynamic graph neural network with adaptive edge attributes for air quality prediction: A case study in china. Heliyon 9.
  304. A comprehensive survey of image augmentation techniques for deep learning. Pattern Recognition , 109347.
  305. Diffusion probabilistic modeling for fine-grained urban traffic flow inference with relaxed structural constraint, in: ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE. pp. 1–5.
  306. Exploring the combined impact of ecosystem services and urbanization on sdgs realization. Applied Geography 153, 102907.
  307. Leveraging language foundation models for human mobility forecasting, in: Proceedings of the 30th International Conference on Advances in Geographic Information Systems, pp. 1–9.
  308. When urban region profiling meets large language models. arXiv preprint arXiv:2310.18340 .
  309. DuARE: Automatic road extraction with aerial images and trajectory data at baidu maps, in: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, New York, NY, USA. pp. 4321–4331. doi:10.1145/3534678.3539029.
  310. Diffusion models: A comprehensive survey of methods and applications. ACM Computing Surveys 56, 1–39.
  311. Local differential privacy and its applications: A comprehensive survey. Computer Standards & Interfaces , 103827.
  312. Multimodal encoder with gated cross-attention for text-vqa tasks, in: 29th Annual Conference of the Language Processing Society, pp. 1580–1585.
  313. Trajgat: A graph-based long-term dependency modeling approach for trajectory similarity computation, in: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 2275–2285.
  314. Deep multi-view spatial-temporal network for taxi demand prediction, in: Proceedings of the AAAI conference on artificial intelligence.
  315. Urbanization forces driving rural urban income disparity: Evidence from metropolitan areas in china. Journal of Cleaner Production 312, 127748.
  316. Zeroquant-v2: Exploring post-training quantization in llms from comprehensive study to low rank compensation. arXiv preprint arXiv:2303.08302 .
  317. mplug-owl: Modularization empowers large language models with multimodality. arXiv preprint arXiv:2304.14178 .
  318. Daily accessed street greenery and housing price: Measuring economic performance of human-scale streetscapes via new urban data. Sustainability 11, 1741.
  319. Predicting fine-grained air quality based on deep neural networks. IEEE Transactions on Big Data 8, 1326–1339.
  320. Deep distributed fusion network for air quality prediction, in: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp. 965–973.
  321. Heterogeneous knowledge fusion: A novel approach for personalized recommendation via llm, in: Proceedings of the 17th ACM Conference on Recommender Systems, pp. 599–601.
  322. Multimodal deep learning for robust road attribute detection. ACM Transactions on Spatial Algorithms and Systems .
  323. Gps2vec: Towards generating worldwide gps embeddings, in: Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 416–419.
  324. A multi-task learning framework for road attribute updating via joint analysis of map data and GPS traces, in: Proceedings of The Web Conference 2020, Association for Computing Machinery, New York, NY, USA. pp. 2662–2668. doi:10.1145/3366423.3380021.
  325. Learning multi-context aware location representations from large-scale geotagged images, in: Proceedings of the 29th ACM International Conference on Multimedia, pp. 899–907.
  326. Short-term local weather forecast using dense weather station by deep neural network, in: 2018 IEEE international conference on big data (big data), IEEE. pp. 1683–1690.
  327. Panda: predicting road risks after natural disasters leveraging heterogeneous urban data. CCF Transactions on Pervasive Computing and Interaction 4, 393–407.
  328. Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. arXiv preprint arXiv:1709.04875 .
  329. Coca: Contrastive captioners are image-text foundation models. arXiv preprint arXiv:2205.01917 .
  330. Temporal data meets llm–explainable financial time series forecasting. arXiv preprint arXiv:2306.11025 .
  331. A spatial–temporal graph attention network approach for air temperature forecasting. Applied Soft Computing 113, 107888.
  332. Personalized travel package with multi-point-of-interest recommendation based on crowdsourced user footprints. IEEE Transactions on Human-Machine Systems 46, 151–158.
  333. A survey of traffic prediction: from spatio-temporal data to intelligent transportation. Data Science and Engineering 6, 63–85.
  334. An effective joint prediction model for travel demands and traffic flows, in: 2021 IEEE 37th International Conference on Data Engineering (ICDE), IEEE. pp. 348–359.
  335. Driving with knowledge from the physical world, in: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 316–324.
  336. T-drive: driving directions based on taxi trajectories, in: Proceedings of the 18th SIGSPATIAL International conference on advances in geographic information systems, pp. 99–108.
  337. Pred: Periodic region detection for mobility modeling of social media users, in: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, pp. 263–272.
  338. A review of deep learning methods for semantic segmentation of remote sensing imagery. Expert Systems with Applications 169, 114417.
  339. Activity trajectory generation via modeling spatiotemporal dynamics, in: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 4752–4762.
  340. Learning to simulate daily activities via modeling dynamic human needs, in: Proceedings of the ACM Web Conference 2023, pp. 906–916.
  341. Difftraj: Generating gps trajectory with diffusion probabilistic model, in: Proceedings of the 37th Annual Conference on Neural Information Processing Systems.
  342. A survey on federated learning. Knowledge-Based Systems 216, 106775.
  343. Chattraffic: Text-to-traffic generation via diffusion model. arXiv preprint arXiv:2311.16203 .
  344. Causal intervention for weakly-supervised semantic segmentation. Advances in Neural Information Processing Systems 33, 655–666.
  345. Mobility prediction: A survey on state-of-the-art schemes and future applications. IEEE access 7, 802–822.
  346. Cross on cross attention: Deep fusion transformer for image captioning. IEEE Transactions on Circuits and Systems for Video Technology .
  347. Deep spatio-temporal residual networks for citywide crowd flows prediction, in: Proceedings of the AAAI conference on artificial intelligence.
  348. Dnn-based prediction model for spatio-temporal data, in: Proceedings of the 24th ACM SIGSPATIAL international conference on advances in geographic information systems, pp. 1–4.
  349. Predicting citywide crowd flows using deep spatio-temporal residual networks. Artificial Intelligence 259, 147–166.
  350. Multi-modal graph interaction for multi-graph convolution network in urban spatiotemporal forecasting. Sustainability 14, 12397.
  351. Region embedding with intra and inter-view contrastive learning. IEEE Transactions on Knowledge and Data Engineering 35, 9031–9036. doi:10.1109/TKDE.2022.3220874.
  352. Adding conditional control to text-to-image diffusion models, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3836–3847.
  353. Multi-view joint graph representation learning for urban region embedding, in: Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, Yokohama, Yokohama, Japan. pp. 4431–4437.
  354. Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition. arXiv:2309.15112.
  355. Deep-air: A hybrid cnn-lstm framework for fine-grained air pollution estimation and forecast in metropolitan cities. IEEE Access 10, 55818–55841.
  356. Causal matching with text embeddings: A case study in estimating the causal effects of peer review policies, in: Findings of the Association for Computational Linguistics: ACL 2023, pp. 1284–1297.
  357. Extended vehicle energy dataset (eved): an enhanced large-scale dataset for deep learning on vehicle trip energy consumption. arXiv preprint arXiv:2203.08630 .
  358. Spatio-temporal fusion and contrastive learning for urban flow prediction. Knowledge-Based Systems 282, 111104.
  359. Linking urbanization and air quality together: A review and a perspective on the future sustainable urban development. Journal of Cleaner Production 346, 130988.
  360. Traffic flow forecasting with spatial-temporal graph diffusion network, in: Proceedings of the AAAI conference on artificial intelligence, pp. 15008–15015.
  361. Interpretable machine learning models for crime prediction. Computers, Environment and Urban Systems 94, 101789.
  362. Functional urban land use recognition integrating multi-source geospatial data and cross-correlations. Computers, Environment and Urban Systems 78, 101374.
  363. An enhanced gan model for automatic satellite-to-map image conversion. IEEE Access 8, 176704–176716.
  364. Deep fake geography? when geospatial data encounter artificial intelligence. Cartography and Geographic Information Science 48, 338–352.
  365. Pgeotopic: A distributed solution for mining geographical topic models. IEEE Transactions on Knowledge and Data Engineering 34, 881–893.
  366. Annotating points of interest with geo-tagged tweets, in: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, Association for Computing Machinery, New York, NY, USA. pp. 417–426. doi:10.1145/2983323.2983850.
  367. Bounding boxes are all we need: street view image classification via context encoding of detected buildings. IEEE Transactions on Geoscience and Remote Sensing 60, 1–17.
  368. Towards personalized maps: Mining user preferences from geo-textual data. Proceedings of the VLDB Endowment 9, 1545–1548.
  369. Spatio-temporal event forecasting using incremental multi-source feature learning. ACM Transactions on Knowledge Discovery from Data 16, 1–28. doi:10.1145/3464976.
  370. Multi-task learning for spatio-temporal event forecasting, in: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, New York, NY, USA. p. 1503–1512. URL: https://doi.org/10.1145/2783258.2783377, doi:10.1145/2783258.2783377.
  371. Photo2trip: Exploiting visual contents in geo-tagged photos for personalized tour recommendation, in: Proceedings of the 25th ACM international conference on Multimedia, pp. 916–924.
  372. Effect of short-term regional traffic restriction on urban submicron particulate pollution. Journal of Environmental Sciences 55, 86–99.
  373. A survey of large language models. arXiv preprint arXiv:2303.18223 .
  374. The ai economist: Taxation policy design via two-level deep multiagent reinforcement learning. Science advances 8, eabk2607.
  375. Methodologies for cross-domain data fusion: An overview. IEEE transactions on big data 1, 16–34.
  376. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology (TIST) 5, 1–55.
  377. A cloud-based knowledge discovery system for monitoring fine-grained air quality. MSR-TR-2014–40, Tech. Rep. .
  378. Disentangling user interest and conformity for recommendation with causal embedding, in: Proceedings of the Web Conference 2021, pp. 2980–2991.
  379. Understanding mobility based on gps data, in: Proceedings of the 10th international conference on Ubiquitous computing, pp. 312–321.
  380. Spatial planning of urban communities via deep reinforcement learning. Nature Computational Science 3, 748–762.
  381. U-air: When urban air quality inference meets big data, in: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 1436–1444.
  382. Road planning for slums via deep reinforcement learning, in: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, p. 5695–5706.
  383. Geolife: A collaborative social networking service among user, location and trajectory. .
  384. Forecasting fine-grained air quality based on big data, in: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, New York, NY, USA. pp. 2267–2276. doi:10.1145/2783258.2788573.
  385. Mining interesting locations and travel sequences from gps trajectories, in: Proceedings of the 18th international conference on World wide web, pp. 791–800.
  386. Diffuflow: Robust fine-grained urban flow inference with denoising diffusion model, in: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, pp. 3505–3513.
  387. One fits all: Power general time series analysis by pretrained lm. arXiv preprint arXiv:2302.11939 .
  388. Maintaining the status quo: Capturing invariant relations for ood spatiotemporal learning .
  389. Inferring region significance by using multi-source spatial data. Neural Computing and Applications 32, 6523–6531.
  390. Neural attention for image captioning: review of outstanding methods. Artificial Intelligence Review 55, 3833–3862.
  391. Learning data augmentation strategies for object detection, in: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXVII 16, Springer. pp. 566–583.
Citations (19)

Summary

We haven't generated a summary for this paper yet.