Predicting Rental Price of Lane Houses in Shanghai with Machine Learning Methods and Large Language Models (2405.17505v1)
Abstract: Housing has emerged as a crucial concern among young individuals residing in major cities, including Shanghai. Given the unprecedented surge in property prices in this metropolis, young people have increasingly resorted to the rental market to address their housing needs. This study utilizes five traditional machine learning methods: multiple linear regression (MLR), ridge regression (RR), lasso regression (LR), decision tree (DT), and random forest (RF), along with a LLM approach using ChatGPT, for predicting the rental prices of lane houses in Shanghai. It applies these methods to examine a public data sample of about 2,609 lane house rental transactions in 2021 in Shanghai, and then compares the results of these methods. In terms of predictive power, RF has achieved the best performance among the traditional methods. However, the LLM approach, particularly in the 10-shot scenario, shows promising results that surpass traditional methods in terms of R-Squared value. The three performance metrics: mean squared error (MSE), mean absolute error (MAE), and R-Squared, are used to evaluate the models. Our conclusion is that while traditional machine learning models offer robust techniques for rental price prediction, the integration of LLM such as ChatGPT holds significant potential for enhancing predictive accuracy.
- Fulong Wu. China’s recent urban development in the process of land and housing marketisation and economic globalisation. Habitat international, 25(3):273–289, 2001.
- Measuring impacts of urban environmental elements on housing prices based on multisource data—a case study of shanghai, china. ISPRS International Journal of Geo-Information, 9(2):106, 2020.
- Analyzing the private rental housing market in shanghai with open data. Land Use Policy, 85:271–284, 2019.
- Market preferences of different operators of long-term rental apartments in a fuzzy environment. Buildings, 13(6):1418, 2023.
- Can millennials access homeownership in urban china? Journal of Housing and the Built Environment, 36(1):69–87, 2021.
- Impact of megacity jobs-housing spatial mismatch on commuting behaviors: A case study on central districts of shanghai, china. Sustainability, 8(2):122, 2016.
- Understanding the intention and behavior of renting houses among the young generation: Evidence from jinan, china. Sustainability, 11(6):1507, 2019.
- Policy network evaluation of china’s rental housing market from the perspective of text measurement. International Journal of Construction Management, pages 1–12, 2024.
- Richard J Cebula. The hedonic pricing model applied to the housing market of the city of savannah and its savannah historic landmark district. Review of Regional Studies, 39(1):9–22, 2009.
- Hedonic price analysis of urban housing: An empirical research on hangzhou, china. Journal of Zhejiang University-Science A, 6(8):907–914, 2005.
- A hedonic price model of office rents. Journal of property valuation and investment, 16(3):297–312, 1998.
- A hedonic price model for private properties in hong kong. The Journal of Real Estate Finance and Economics, 10:37–48, 1995.
- Real estate value prediction using linear regression. In 2018 fourth international conference on computing communication control and automation (ICCUBEA), pages 1–5. IEEE, 2018.
- Chenhao Zhou. House price prediction using polynomial regression with particle swarm optimization. In Journal of Physics: Conference Series, volume 1802, page 032034. IOP Publishing, 2021.
- House price prediction using hedonic pricing model and machine learning techniques. Concurrency and computation: practice and experience, 34(27):e7342, 2022.
- Price prediction of house using knn based lasso and ridge model. In 2022 International Conference on Sustainable Computing and Data Communication Systems (ICSCDS), pages 1520–1527. IEEE, 2022.
- House price prediction using linear and lasso regression. In 2024 3rd International Conference for Innovation in Technology (INOCON), pages 1–5. IEEE, 2024.
- Machine learning based predicting house prices using regression techniques. In 2020 2nd International conference on innovative mechanisms for industry applications (ICIMIA), pages 624–630. IEEE, 2020.
- Zhishuo Zhang. Decision trees for objective house price prediction. In 2021 3rd International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), pages 280–283. IEEE, 2021.
- House price forecasting using machine learning. In Proceedings of the 3rd International Conference on Advances in Science & Technology (ICAST), 2020.
- Machine learning for inference: using gradient boosting decision tree to assess non-linear effects of bus rapid transit on house prices. Annals of GIS, 27(3):273–284, 2021.
- House price prediction model using random forest in surabaya city. 2023.
- Flat price prediction using linear and random forest regression based on machine learning techniques. In Embracing Industry 4.0: Selected Articles from MUCET 2019, pages 205–217. Springer, 2020.
- House price prediction using random forest machine learning technique. Procedia Computer Science, 199:806–813, 2022.
- Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020.
- Bidirectional transformer based on online text-based information to implement convolutional neural network model for secure business investment. In 2020 IEEE International Symposium on Technology and Society (ISTAS), pages 322–329. IEEE, 2020.
- Machine learning-based classification of house prices: A comparative study. In 2023 5th International Conference on Cybernetics and Intelligent System (ICORIS), pages 1–6. IEEE, 2023.
- House price estimation using an eigenvector spatial filtering approach. International Journal of Housing Markets and Analysis, 13(5):845–867, 2020.
- A new appraisal model of second-hand housing prices in china’s first-tier cities based on machine learning algorithms. Computational Economics, 57:617–637, 2021.
- Second-hand housing batch evaluation model of zhengzhou city based on big data and mgwr model. Journal of Intelligent & Fuzzy Systems, 42(4):4221–4240, 2022.
- Jangaraj Avanijaa et al. Prediction of house price using xgboost regression algorithm. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(2):2151–2155, 2021.
- Enhanced approach of house cost prediction using machine learning. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 11(2):1205–1229, 2020.
- The coefficient of determination r-squared is more informative than smape, mae, mape, mse and rmse in regression analysis evaluation. Peerj computer science, 7:e623, 2021.
- A study on multiple linear regression analysis. Procedia-Social and Behavioral Sciences, 106:234–240, 2013.
- Ashok Vithoba Dorugade. New ridge parameters for ridge regression. Journal of the Association of Arab Universities for Basic and Applied Sciences, 15:94–99, 2014.
- A study of error variance estimation in lasso regression. Statistica Sinica, pages 35–67, 2016.
- Overview of use of decision tree algorithms in machine learning. In 2011 IEEE control and system graduate research colloquium, pages 37–42. IEEE, 2011.
- Steven J Rigatti. Random forest. Journal of Insurance Medicine, 47(1):31–39, 2017.
- Time-llm: Time series forecasting by reprogramming large language models. arXiv preprint arXiv:2310.01728, 2023.