Data-Driven Approaches for Forecasting Cost Overruns in Infrastructure Projects
DOI:
https://doi.org/10.47672/ejt.2891Keywords:
Cost Overruns, Infrastructure Projects, Construction Estimation, Hyperparameter Optimization, SHAP Analysis, Predictive Modeling, Sustainable DevelopmentAbstract
Purpose: The aim of the study was to assess sustainable development depends on the infrastructure projects that affect the long-term economic, social, and environmental impacts. Cost overruns, however, tend to occur in construction projects.
Methodology: This research study uses the Build Bridges, Not Walls dataset available in the U.S. National Bridge Inventory, which contains over 600,000 bridge records, and builds a predictive model to estimate the number of infrastructure cost overruns. To begin with, it involved extensive preprocessing, including imputing missing values, scaling the features, and analyzing their correlations. The model was assessed using R2, RMSE, MAE, MAPE, and SHAP interpretability indicators.
Findings: The XGBoost model performed very well on the prediction accuracy metric, achieving an R2 of 95.67%, exceeding the ANN benchmark (R2 = 77.3%). The analysis of the Shapley values showed that landform and location were the most influential features, with construction type being a minor factor. Additionally, the strength of XGBoost was further confirmed through evaluations of RMSE, loss curves, and feature-wise performance plots.
Implications to Theory, Practice and Policy: The overall conclusion from the study is that applying ensemble learning methods especially XGBoost was the right approach for plausible costing of large-scale infrastructure projects, leading to better planning and decision-making.
Downloads
References
[1] E. Plebankiewicz, “Model of Predicting Cost Overrun in Construction Projects,” Sustainability, vol. 10, no. 12, p. 4387, Nov. 2018, doi: 10.3390/su10124387.
[2] S. Garg, “AI/ML Driven Proactive Performance Monitoring, Resource Allocation and Effective Cost Management in SaaS Operations,” Int. J. Core Eng. Manag., vol. 6, no. 6, pp. 263–273, 2019.
[3] C. Deb, F. Zhang, J. Yang, S. E. Lee, and K. W. Shah, “A review on time series forecasting techniques for building energy consumption,” Renew. Sustain. Energy Rev., vol. 74, pp. 902–924, Jul. 2017, doi: 10.1016/j.rser.2017.02.085.
[4] X. Jiang, L. Zhang, C. Xiong, and R. Wang, “Transportation and Regional Economic Development: Analysis of Spatial Spillovers in China Provincial Regions,” Networks Spat. Econ., vol. 16, no. 3, pp. 769–790, Sep. 2016, doi: 10.1007/s11067-015-9298-2.
[5] J. H. Min, W. Jang, S. H. Han, D. Kim, and Y. H. Kwak, “How Conflict Occurs and What Causes Conflict: Conflict Analysis Framework for Public Infrastructure Projects,” J. Manag. Eng., vol. 34, no. 4, Jul. 2018, doi: 10.1061/(ASCE)ME.1943-5479.0000625.
[6] S. Garg, “Predictive Analytics and Auto Remediation using Artificial Intelligence and Machine learning in Cloud Computing Operations,” Int. J. Innov. Res. Eng. Multidiscip. Phys. Sci., vol. 7, no. 2, 2019, doi: 10.5281/zenodo.15362327.
[7] A. V Bataev, “Efficiency Estimation Model of 3D Technology in the Construction Industry,” in 2019 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus), 2019, pp. 1376–1381. doi: 10.1109/EIConRus . 2019.8657181.
[8] R. Bao, R. Komatsu, R. Miyagusuku, M. Chino, A. Yamashita, and H. Asama, “Cost-effective and Robust Visual-Based Localization with Consumer-level Cameras at Construction Sites,” in 2019 IEEE 8th Global Conference on Consumer Electronics (GCCE), 2019, pp. 983–985. doi: 10.1109/GCCE46687.2019.9015417.
[9] M. S. Bajjou and A. Chafi, “Towards implementing lean construction in the Moroccan construction industry: Survey study,” in 2018 4th International Conference on Optimization and Applications (ICOA), 2018, pp. 1–5. doi: 10.1109/ICOA.2018.8370556.
[10] S. S. Arage and N. V. Dharwadkar, “Cost estimation of civil construction projects using machine learning paradigm,” in 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), 2017, pp. 594–599. doi: 10.1109/I-SMAC.2017.8058249.
[11] R. M. N. Hemal, K. G. A. S. Waidyasekara, and E. M. A. C. Ekanayake, “Integrating design freeze into large-scale construction projects in Sri Lanka,” in 2017 Moratuwa Engineering Research Conference (MERCon), 2017, pp. 259–264. doi: 10.1109/MERCon . 2017.7980492.
[12] M. Morad and S. M. El-Sayegh, “Use of Earned Value Management in the UAE Construction Industry,” in 2016 International Conference on Industrial Engineering, Management Science and Application (ICIMSA), 2016, pp. 1–5. doi: 10.1109/ICIMSA.2016.7504044.
[13] L. Pang, J. Wang, L. Zhao, C. Wang, and H. Zhan, “A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer’s Disease,” vol. 9, no. January, pp. 1–7, 2019, doi: 10.3389/fgene . 2018.00751.
[14] H. Zheng and Y. Wu, “A XGBoost Model with Weather Similarity Analysis and Feature Engineering for Short-Term Wind Power Forecasting,” Appl. Sci., vol. 9, no. 15, p. 3019, Jul. 2019, doi: 10.3390/app9153019.
[15] F. M. S. Al-zwainy and I. A. Aidan, “Forecasting the Cost of Structure of Infrastructure Projects Utilizing Artificial Neural Network Model ( Highway Projects as Case Study ),” vol. 10, no. May, 2017, doi: 10.17485/ijst/2017/v10i20/108567.
Attipalli, A., BITKURI, V., KURMA, J., Enokkaren, S., Kendyala, R., & Mamidala, J. V. (2021). A Survey of Artificial Intelligence Methods in Liquidity Risk Management: Challenges and Future Directions. Available at SSRN 5741342.
Attipalli, A., BITKURI, V., Mamidala, J. V., Kendyala, R., & KURMA, J. (2022). Empowering Cloud Security with Artificial Intelligence: Detecting Threats Using Advanced Machine learning Technologies. Available at SSRN 5741263.
Attipalli, A., Enokkaren, S., BITKURI, V., Kendyala, R., KURMA, J., & Mamidala, J. V. (2021). Enhancing Cloud Infrastructure Security Through AI-Powered Big Data Anomaly Detection. Available at SSRN 5741305.
Bitkuri, V., Kendyala, R., Kurma, J., Mamidala, V., Enokkaren, S. J., & Attipalli, A. (2021). Systematic Review of Artificial Intelligence Techniques for Enhancing Financial Reporting and Regulatory Compliance. International Journal of Emerging Trends in Computer Science and Information Technology, 2(4), 73-80.
Kothamaram, R. R., Rajendran, D., Namburi, V. D., Singh, A. A. S., Tamilmani, V., & Maniar, V. (2021). A Survey of Adoption Challenges and Barriers in Implementing Digital Payroll Management Systems in Across Organizations. International Journal of Emerging Research in Engineering and Technology, 2(2), 64-72.
Padur, S. K. R. (2018). Autonomous cloud economics: AI driven right sizing and cost optimization in hybrid infrastructures. International Journal of Scientific Research in Science and Technology, 4(5), 2090-2097.
Padur, S. K. R. (2019). Machine learning for predictive capacity planning: Evolution from analytical modeling to autonomous infrastructure. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 5(5), 285-293.
Padur, S. K. R. (2020). AI augmented disaster recovery simulations: From chaos engineering to autonomous resilience orchestration. International Journal of Scientific Research in Science, Engineering and Technology, 7(6), 367-378.
Padur, S. K. R. (2020). From centralized control to democratized insights: Migrating enterprise reporting from IBM Cognos to Microsoft Power BI. Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol, 6(1), 218-225.
Padur, S. K. R. (2021). Bridging Human, System, and Cloud Integration through RESTful Automation and Governance. the International Journal of Science, Engineering and Technology, 9(6).
Padur, S. K. R. (2021). Deep learning and process mining for ERP anomaly detection: Toward predictive and self-monitoring enterprise platforms. Available at SSRN 5605531.
Padur, S. K. R. (2021). From Control to Code: Governance Models for Multi-Cloud ERP Modernization. International Journal of Scientific Research & Engineering Trends, 7(3).
Padur, S. K. R. (2022). AI augmented platform engineering, transforming developer experience through intelligent automation and self optimizing internal platforms. International Journal of Science, Engineering and Technology, 10(5), 10-5281.
Padur, S. K. R. (2022). Intelligent resource management: AI methods for predictive workload forecasting in cloud data centers. J. Artif. Intell. Mach. Learn. & Data Sci, 1(1), 2936-2941.
Polam, R. M., Kamarthapu, B., Kakani, A. B., Nandiraju, S. K. K., Chundru, S. K., & Vangala, S. R. (2022). Data Security in Cloud Computing: Encryption, Zero Trust, and Homomorphic Encryption. International Journal of Emerging Trends in Computer Science and Information Technology, 3(4), 31-41.
Polu, A. R., Buddula, D. V. K. R., Narra, B., Gupta, A., Vattikonda, N., & Patchipulusu, H. (2021). Evolution of AI in Software Development and Cybersecurity: Unifying Automation, Innovation, and Protection in the Digital Age. Available at SSRN 5266517.
Rajendran, D., Namburi, V. D., Singh, A. A. S., Tamilmani, V., Maniar, V., & Kothamaram, R. R. (2021). Anomaly Identification in IoT-Networks Using Artificial Intelligence-Based Data-Driven Techniques in Cloud Environmen. International Journal of Emerging Trends in Computer Science and Information Technology, 2(2), 83-91.
Reddy Padur, S. K. (2021). From Scripts to Platforms-as-Code: The Role of Terraform and Ansible in Declarative Infrastructure Rollouts. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 621-628.
Routhu, K. K. (2021). Harnessing AI Dashboards in Oracle Cloud HCM: Advancing Predictive Workforce Intelligence and Managerial Agility. International Journal of Scientific Research & Engineering Trends, 7(6).
Routhu, K. K. (2022). From Case Management to Conversational HR: Redefining Help Desks with Oracle’s AI and NLP Framework. International Journal of Science, Engineering and Technology, 10(6).
Routhu, K. K. (2022). From RFID to Geofencing: IoT-Enabled Smart Time Tracking in Oracle HCM Cloud. International Journal of Science, Engineering and Technology, 10(4).
Singh, A. A. S., Tamilmani, V., Maniar, V., Kothamaram, R. R., Rajendran, D., & Namburi, V. D. (2021). Predictive Modeling for Classification of SMS Spam Using NLP and ML Techniques. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 2(4), 60-69.
Vattikonda, N., Gupta, A. K., Polu, A. R., Narra, B., Buddula, D. V. K. R., & Patchipulusu, H. H. S. (2022). Blockchain Technology in Supply Chain and Logistics: A Comprehensive Review of Applications, Challenges, and Innovations. International Journal of Emerging Trends in Computer Science and Information Technology, 3(3), 72-80.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Sridhar Reddy Bandaru, Dhuli Shyam, Prabu Manoharan, Muzaffer Hussain Syed, , Uday Kumar Ragireddy, , Prasanth Varma Addepalli

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution (CC-BY) 4.0 License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.