Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Large-scale Benchmark Dataset for Commuting Origin-destination Matrix Generation (2407.15823v3)

Published 22 Jul 2024 in cs.SI

Abstract: The commuting origin-destination~(OD) matrix is a critical input for urban planning and transportation, providing crucial information about the population residing in one region and working in another within an interested area. Despite its importance, obtaining and updating the matrix is challenging due to high costs and privacy concerns. This has spurred research into generating commuting OD matrices for areas lacking historical data, utilizing readily available information via computational models. In this regard, existing research is primarily restricted to only a single or few large cities, preventing these models from being applied effectively in other areas with distinct characteristics, particularly in towns and rural areas where such data is urgently needed. To address this, we propose a large-scale dataset comprising commuting OD matrices for 3,233 diverse areas around the U.S. For each area, we provide the commuting OD matrix, combined with regional attributes including demographics and point-of-interests of each region in that area. We believe this comprehensive dataset will facilitate the development of more generalizable commuting OD matrix generation models, which can capture various patterns of distinct areas. Additionally, we use this dataset to benchmark a set of commuting OD generation models, including physical models, element-wise predictive models, and matrix-wise generative models. Surprisingly, we find a new paradigm, which considers the whole area combined with its commuting OD matrix as an attributed directed weighted graph and generates the weighted edges based on the node attributes, can achieve the optimal. This may inspire a new research direction from graph learning in this field.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (42)
  1. Human mobility: Models and applications. Physics Reports, 734:1–74, 2018.
  2. Michael Batty. Cities and complexity: understanding cities with cellular automata, agent-based models, and fractals. The MIT press, 2007.
  3. Netgan: Generating graphs via random walks. In International conference on machine learning, pages 610–619. PMLR, 2018.
  4. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
  5. A generalization of transformer networks to graphs. arXiv preprint arXiv:2012.09699, 2020.
  6. Understanding individual human mobility patterns. nature, 453(7196):779–782, 2008.
  7. Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
  8. Origin-destination trips generated from operational data of a mobile network for urban transportation planning. Journal of Urban Planning and Development, 147(1):04020049, 2021.
  9. Development of origin–destination matrices using mobile phone call data. Transportation Research Part C: Emerging Technologies, 40:63–74, 2014.
  10. The timegeo modeling framework for urban mobility without travel surveys. Proceedings of the National Academy of Sciences, 113(37):E5370–E5378, 2016.
  11. Origin-destination matrix estimation using socio-economic information and traffic counts on uncongested networks. International Journal of Transportation Engineering, 8(2):165–183, 2020.
  12. Systematic comparison of trip distribution laws and models. Journal of Transport Geography, 51:158–169, 2016.
  13. Influence of sociodemographic characteristics on human mobility. Scientific reports, 5(1):10075, 2015.
  14. Learning geo-contextual embeddings for commuting flow prediction. In Proceedings of the AAAI conference on artificial intelligence, volume 34, pages 808–816, 2020.
  15. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101, 2017.
  16. Improved denoising diffusion probabilistic models. In International Conference on Machine Learning, pages 8162–8171. PMLR, 2021.
  17. OpenStreetMap contributors. Planet dump retrieved from https://planet.osm.org . https://www.openstreetmap.org, 2017.
  18. Film: Visual reasoning with a general conditioning layer. In Proceedings of the AAAI conference on artificial intelligence, volume 32, 2018.
  19. Trip distribution modeling with twitter data. Computers, Environment and Urban Systems, 77:101354, 2019.
  20. Enhancing trip distribution prediction with twitter data: comparison of neural network and gravity models. In Proceedings of the 2nd acm sigspatial international workshop on ai for geographic knowledge discovery, pages 5–8, 2018.
  21. Improving language understanding by generative pre-training. 2018.
  22. A machine learning approach to modeling human migration. In Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies, pages 1–8, 2018.
  23. Origin–destination matrix estimation and prediction from socioeconomic variables using automatic feature selection procedure-based machine learning model. Journal of Urban Planning and Development, 147(4):04021056, 2021.
  24. An interdisciplinary survey on origin-destination flows modeling: Theory and techniques. arXiv preprint arXiv:2306.10048, 2023.
  25. Complexity-aware large scale origin-destination network generation via diffusion model. arXiv preprint arXiv:2306.04873, 2023.
  26. Goddag: generating origin-destination flow for new cities via domain adversarial training. IEEE Transactions on Knowledge and Data Engineering, 2023.
  27. Origin-destination network generation via gravity-guided gan. arXiv preprint arXiv:2306.03390, 2023.
  28. A complex network perspective for characterizing urban travel demand patterns: graph theoretical analysis of large-scale origin–destination demand networks. Transportation, 44:1383–1402, 2017.
  29. A complex network methodology for travel demand model evaluation and validation. Networks and Spatial Economics, 18:1051–1073, 2018.
  30. Using google’s passive data and machine learning for origin-destination demand estimation. Transportation Research Record, 2672(46):73–82, 2018.
  31. A deep gravity model for mobility flows generation. Nature communications, 12(1):6576, 2021.
  32. A universal model for mobility and migration patterns. Nature, 484(7392):96–100, 2012.
  33. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502, 2020.
  34. U.S. Census Bureau. 2009-2011 american community survey 3-year public use microdata samples [sas data file]. https://factfinder.census.gov/faces/nav/jsf/pages/searchresults.xhtml?refresh=t, 2012.
  35. Graph attention networks. arXiv preprint arXiv:1710.10903, 2017.
  36. Digress: Discrete denoising diffusion for graph generation. arXiv preprint arXiv:2209.14734, 2022.
  37. Origin-destination matrix prediction via graph convolution: a new perspective of passenger demand modeling. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pages 1227–1235, 2019.
  38. Urban dynamics through the lens of human mobility. Nat Comput Sci, 3:611–620, 2023.
  39. Spatial origin-destination flow imputation using graph convolutional networks. IEEE Transactions on Intelligent Transportation Systems, 22(12):7474–7484, 2020.
  40. Causal learning empowered od prediction for urban planning. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, pages 2455–2464, 2022.
  41. Detecting the dynamics of urban structure through spatial network analysis. International Journal of Geographical Information Science, 28(11):2178–2199, 2014.
  42. George Kingsley Zipf. The p 1 p 2/d hypothesis: on the intercity movement of persons. American sociological review, 11(6):677–686, 1946.
Citations (1)

Summary

We haven't generated a summary for this paper yet.