Proceedings of the
9th International Symposium for Geotechnical Safety and Risk (ISGSR)
25 – 28 August 2025, Oslo, Norway
Editors: Zhongqiang Liu, Jian Dai and Kate Robinson

The Study of Predicting Corrosion Failure Risks in Urban Pipeline Networks Based on Machine Learning

Zongyuan Zhang1, Qunfang Hu2,a, Fei Wang2,b, Zhan Su2,c and Jiahua Zhou3

1Urban Mobility Institute, Tongji University, Shanghai, China.

2110795@tongji.edu.cn

2Shanghai Institute of Disaster Prevention and Relief, Tongji University, Shanghai, China.

ahuqunf@tongji.edu.cn

bwangf@tongji.edu.cn

c2305219@tongji.edu.cn

3College of Civil Engineering, Tongji University, Shanghai, China.

zjh1571114545@163.com

ABSTRACT

Corrosion is one of the primary causes of pipeline network failures, significantly impacting urban health and safety. To prevent pipeline failures, predicting pipeline corrosion has become a critical requirement for operators in daily operations and asset management. This study uses 20 years of maintenance data from water supply networks with DN300 and above in a certain area of Shanghai as a sample, considering 10 variables as input features, including pipeline attributes (e.g., material, diameter, age) and environmental factors (e.g., burial depth, road type, distance to subway). To address the issue of data imbalance, three data processing methods are employed: random oversampling, synthetic minority over-sampling technique for nominal and continuous (SMOTENC), and Adaptive Synthetic Sampling (ADASYN). Three machine learning models, including Support Vector Classifier (SVC), Logistic Regression (LR), and LightGBM, are developed for comparative analysis to predict pipeline corrosion failure risks. The results show that the LightGBM model with SMOTENC performs excellently in identifying urban pipeline corrosion, with an accuracy of 0.936 and an AUC of 0.925. Finally, the corrosion failure risk of the pipeline network in the region was assessed, and the risk was classified into five levels from high to low using the Jenks Natural Breaks method. The distribution of the pipeline network across these risk levels is as follows: 0.42%, 2.39%, 28.68%, 35.68%, and 32.83%.This study provides valuable insights into the safe operation of urban pipeline networks and helps operators develop more effective prevention and maintenance strategies, enhancing the safety and reliability of urban water supply networks.

Keywords: Pipeline networks, Machine learning, Pipeline corrosion, Data processing, Predictive modeling, Failure prediction.



Download PDF