Generating Synthetic Population (2209.09961v2)
Abstract: In this paper, we provide a method to generate synthetic population at various administrative levels for a country like India. This synthetic population is created using machine learning and statistical methods applied to survey data such as Census of India 2011, IHDS-II, NSS-68th round, GPW etc. The synthetic population defines individuals in the population with characteristics such as age, gender, height, weight, home and work location, household structure, preexisting health conditions, socio-economical status, and employment. We used the proposed method to generate the synthetic population for various districts of India. We also compare this synthetic population with source data using various metrics. The experiment results show that the synthetic data can realistically simulate the population for various districts of India.
- Centre for Research in Micro Census Data, 2011. URL https://www.isical.ac.in/~library/census.php.
- Creating synthetic baseline populations. Transportation Research Part A: Policy and Practice, 30(0965-8564):415–429, 1996.
- Bonabeau, E. Agent-based modeling: Methods and techniques for simulating human systems. Proceedings of the National Academy of Sciences, 99(3):7280–7287, 2002.
- CIESIN - Columbia University. Gridded population of the world, (GPWv4): Population density, 2016.
- On a Least Squares Adjustment of a Sampled Frequency Table When the Expected Marginal Totals are Known. The Annals of Mathematical Statistics, 11(4):427–444, 1940.
- India Human Development Survey-II (IHDS-II), 2011-12. Inter-university Consortium for Political and Social Research, 2018.
- Generative adversarial nets. In Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N. D., and Weinberger, K. Q. (eds.), Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, December 8-13 2014, pp. 2672–2680, Montreal, Quebec, Canada, 2014.
- J. Hijmans, R. Database of Global Administrative Areas, 2018. URL https://gadm.org/data.html.
- Ministry of Education, GoI. Steps taken by Government to Provide Education to Poor Students, 07 2019. URL https://pib.gov.in/PressReleasePage.aspx?PRID=1578389.
- NSSO. India - Household Consumer Expenditure, NSS 68th Round . Technical report, 2018. URL http://microdata.gov.in/nada43/index.php/catalog/126.
- Office of the Registrar General and Census Commissioner of India, India. CENSUS TABLES, 2011. URL https://censusindia.gov.in/census.website/data/census-tables.
- Modeling tabular data using conditional gan. In Advances in Neural Information Processing Systems, 2019.
- Methodology to match distributions of both household and person attributes in generation of synthetic populations. 01 2009.