Citation: | Liang Z H,Miao P Y,Wang J M,Wang Z F. 2024. Earthquake loss prediction based on random forest algorithm. Acta Seismologica Sinica,46(4):649−662. DOI: 10.11939/jass.20220182 |
Rapid assessment of building damage and its severity after an earthquake is crucial for emergency response and recovery. Accurate earthquake damage assessment is crucial for pre-earthquake disaster prevention and mitigation, post-earthquake disaster relief, and rapid reconstruction. Most existing studies based on actual earthquake damage assessment are limited to a specific region and a particular structure type, and the number of data samples used is also limited, resulting in subpar generalization performance for the model. Many factors affect the loss of buildings due to earthquakes. Traditional methods cannot fully consider the complex mapping relationship between the influencing factors. Therefore, finding a method to quickly and accurately assess building damage is essential. Machine learning provides a data-driven artificial intelligence method that can handle complex nonlinear relationships between input and output parameters by learning the underlying laws of big data. This paper proposes an earthquake damage prediction model based on combination of Bayesian optimization algorithm, synthetic minority over-sampling technique (SMOTE), and random forest algorithm. The core of the Bayesian optimization algorithm takes prior knowledge into account. It can continuously update and iterate until the optimal parameter combination is fitted, solving the problem of slow efficiency of traditional parameter adjustment. The core of the SMOTE method is to generate data samples of a few categories, solving the problem of uneven distribution of data samples. Based on the random forest model, this paper uses 378 037 actual building damage data from the March 11, 2011, MW9.0 Tohoku-Oki, Japan earthquake, comprehensively considers multidimensional building information such as ground shaking information, site information, and structural characteristics, and uses the earthquake damage classification issued by the American Applied Technical Council (ATC-13). This model can predict the damage caused by earthquake damage to buildings and analyze the feature importance of factors affecting building damage. The results show that after using SMOTE method to solve data imbalance and the Bayesian approach to optimize hyper-parameters, the accuracy on the test set of the random forest-based prediction model is 68.8%, and the recall rates for minor damage, moderate damage, severe damage and collapse are 65.0%, 53.6%, 74.8%, and 81.8%, respectively; the accuracy of the model is further increased to 87.5% by considering the life safety performance to convert the model to dichotomous classification, which significantly improves the existing research problems in building loss prediction, such as limited data, lack of regional generalization, lack of diversity in building attributes, imprecise classification of damage levels and low accuracy of the most severe damage state. The study of the importance of random forest features showed that the epicenter distance, PGA and vS30 have the most significant influences on the model output.The earthquake damage assessment model established by this study can achieve rapid and relatively accurate prediction of building damage caused by earthquakes, which is beneficial for pre-earthquake planning and timely rescue after the earthquake.
鲍跃全,李惠. 2019. 人工智能时代的土木工程[J]. 土木工程学报,52(5):1–11.
|
Bao Y Q,Li H. 2019. Artificial intelligence for civil engineering[J]. China Civil Engineering Journal,52(5):1–11 (in Chinese).
|
孙柏涛,胡少卿. 2005. 基于已有震害矩阵模拟的群体震害预测方法研究[J]. 地震工程与工程振动,25(6):102–108.
|
Sun B T,Hu S Q. 2005. A method for earthquake damage prediction of building group based on existing earthquake damage matrix[J]. Earthquake Engineering and Engineering Vibration,25(6):102–108 (in Chinese).
|
王健峰. 2012. 基于改进网格搜索法SVM参数优化的说话人识别研究[D]. 哈尔滨:哈尔滨工程大学:25−26.
|
Wang J F. 2012. Study on Speaker Recognition Based on Improved Grid Search Parameters Optimization Algorithm of SVM[D]. Harbin:Harbin Engineering University:25−26 (in Chinese).
|
王自法,Park S,Lee S,崔凯. 2014. 提高地震灾害损失估计精度的几点研究[J]. 地震工程与工程振动,34(4):110–114.
|
Wang Z F,Park S,Lee S,Cui K. 2014. Quantification improvement of earthquake loss estimation[J]. Earthquake Engineering and Engineering Dynamics,34(4):110–114 (in Chinese).
|
隗永刚,蒋长胜. 2021. 人工智能技术在地震减灾应用中的研究进展[J]. 地球物理学进展,36(2):516–524. doi: 10.6038/pg2021EE0164
|
Wei Y G,Jiang C S. 2021. Research progress of artificial intelligence technology in the application of earthquake disaster reduction[J]. Progress in Geophysics,36(2):516–524 (in Chinese).
|
杨旭,李永华,盖增喜. 2021. 机器学习在地震学中的应用进展[J]. 地球与行星物理论评,52(1):76–88.
|
Yang X,Li Y H,Gai Z X. 2021. Machine learning and its application in seismology[J]. Reviews of Geophysics and Planetary Physics,52(1):76–88 (in Chinese).
|
杨毅,卢诚波,徐根海. 2017. 面向不平衡数据集的一种精化Borderline-SMOTE方法[J]. 复旦学报(自然科学版),56(5):537–544.
|
Yang Y,Lu C B,Xu G H. 2017. A refined borderline-SMOTE method for imbalanced data set[J]. Journal of Fudan University (Natural Science),56(5):537–544 (in Chinese).
|
于红梅,许建东,张素灵,潘波. 2006. 基于集集地震的建筑物易损性统计分析[J]. 防灾科技学院学报,8(4):17–20. doi: 10.3969/j.issn.1673-8047.2006.04.004
|
Yu H M,Xu J D,Zhang S L,Pan B. 2006. The statistical analysis of building vulnerability research on Jiji earthquake[J]. Journal of Institute of Disaster Prevention,8(4):17–20 (in Chinese).
|
张风华,谢礼立,范立础. 2004. 城市建构筑物地震损失预测研究[J]. 地震工程与工程振动,24(3):12–20. doi: 10.3969/j.issn.1000-1301.2004.03.002
|
Zhang F H,Xie L L,Fan L C. 2004. A study on disaster loss prediction caused by damaged structures under earthquake[J]. Earthquake Engineering and Engineering Vibration,24(3):12–20 (in Chinese).
|
张桂欣,孙柏涛. 2018. 基于模糊层次分析的建筑物单体震害预测方法研究[J]. 工程力学,35(12):185–193.
|
Zhang G X,Sun B T. 2018. Seismic damage prediction for a single building based on a fuzzy analytical hierarchy approach[J]. Engineering Mechanics,35(12):185–193 (in Chinese).
|
张浩. 2018. 自动化特征工程与参数调整算法研究[D]. 成都:电子科技大学:26.
|
Zhang H. 2018. Research of Automatic Feature Engineering and Parameter Adjustment Algorithm[D]. Chengdu:University of Electronic Science and Technology of China:26 (in Chinese).
|
张天翼,丁立新. 2021. 一种基于SMOTE的不平衡数据集重采样方法[J]. 计算机应用与软件,38(9):273–279.
|
Zhang T Y,Ding L X. 2021. A new resampling method based on SMOTE for imbalanced data set[J]. Computer Applications and Software,38(9):273–279 (in Chinese).
|
赵登科,王自法,刘渊,仝文博. 2021. 基于新西兰实际震害资料的地震损失不确定性分析[J]. 地震工程与工程振动,41(2):84–95.
|
Zhao D K,Wang Z F,Liu Y,Tong W B. 2021. Earthquake loss uncertainty based on detailed loss data in New Zealand[J]. Earthquake Engineering and Engineering Dynamics,41(2):84–95 (in Chinese).
|
Applied Technology Council. 1985. Earthquake Damage Evaluation Data for California[M]. Redwood City:Applied Technology Council:167−219.
|
Bergstra J,Bengio Y. 2012. Random search for hyper-parameter optimization[J]. J Machine Learn Res,13:281–305.
|
Breiman L. 2001. Random forests[J]. Mach Learn,45(1):5–32. doi: 10.1023/A:1010933404324
|
Calvi G M,Pinho R,Magenes G,Bommer J J,Restrepo-Vélez L F,Crowley H. 2006. Development of seismic vulnerability assessment methodologies over the past 30 years[J]. ISET J Earthq Technol,43(3):75–104.
|
Chawla N V,Bowyer K W,Hall L O,Kegelmeyer W P. 2002. SMOTE:Synthetic minority over-sampling technique[J]. J Artif Intell Res,16:321–357. doi: 10.1613/jair.953
|
Ghimire S,Guéguen P,Giffard-Roisin S,Schorlemmer D. 2022. Testing machine learning models for seismic damage prediction at a regional scale using building-damage dataset compiled after the 2015 Gorkha Nepal earthquake[J]. Earthq Spectra,38(4):2970–2993. doi: 10.1177/87552930221106495
|
Harirchian E,Lahmer T,Rasulzade S. 2020a. Earthquake hazard safety assessment of existing buildings using optimized multi-layer perceptron neural network[J]. Energies,13(8):2060. doi: 10.3390/en13082060
|
Harirchian E,Lahmer T,Kumari V,Jadhav K. 2020b. Application of support vector machine modeling for the rapid seismic hazard safety evaluation of existing buildings[J]. Energies,13(13):3340. doi: 10.3390/en13133340
|
Harirchian E,Kumari V,Jadhav K,Raj Das R,Rasulzade S,Lahmer T. 2020c. A machine learning framework for assessing seismic hazard safety of reinforced concrete buildings[J]. Appl Sci,10(20):7153. doi: 10.3390/app10207153
|
Harirchian E,Hosseini S E A,Jadhav K,Kumari V,Rasulzade S,Işık E,Wasif M,Lahmer T. 2021a. A review on application of soft computing techniques for the rapid visual safety evaluation and damage classification of existing buildings[J]. J Build Eng,43:102536. doi: 10.1016/j.jobe.2021.102536
|
Harirchian E,Kumari V,Jadhav K,Rasulzade S,Lahmer T,Raj Das R. 2021b. A synthesized study based on machine learning approaches for rapid classifying earthquake damage grades to RC buildings[J]. Appl Sci,11(16):7540. doi: 10.3390/app11167540
|
Hwang S H,Mangalathu S,Shin J,Jeon J S. 2021. Machine learning-based approaches for seismic demand and collapse of ductile reinforced concrete building frames[J]. J Build Eng,34:101905. doi: 10.1016/j.jobe.2020.101905
|
Lerman P M. 1980. Fitting segmented regression models by grid search[J]. J R Stat Soc Series C Appl Stat,29(1):77–84.
|
Mangalathu S,Burton H V. 2019. Deep learning-based classification of earthquake-impacted buildings using textual damage descriptions[J]. Int J Disast Risk Reduct,36:101111. doi: 10.1016/j.ijdrr.2019.101111
|
Mangalathu S,Sun H,Nweke C C,Yi Z X,Burton H V. 2020. Classifying earthquake damage to buildings using machine learning[J]. Earthq Spectra,36(1):183–208. doi: 10.1177/8755293019878137
|
Mansourdehghan S,Dolatshahi K M,Asjodi A H. 2022. Data-driven damage assessment of reinforced concrete shear walls using visual features of damage[J]. J Build Eng,53:104509. doi: 10.1016/j.jobe.2022.104509
|
McCormack T C,Rad F N. 1997. An earthquake loss estimation methodology for buildings based on ATC-13 and ATC-21[J]. Earthq Spectra,13(4):605–621. doi: 10.1193/1.1585971
|
Miyakoshi J,Hayashi Y,Tamura K,Fukuwa N. 1997. Damage ratio functions of buildings using damage data of the 1995 Hyogo-Ken Nanbu earthquake[C]//Proceedings of the 7th International Conference on Structural Safety and Reliability. Kyoto:International Association for Structural Safety and Reliability:349−354.
|
Pedregosa F,Varoquaux G,Gramfort A,Michel V,Thirion B,Grisel O,Blondel M,Prettenhofer P,Weiss R,Dubourg V,Vanderplas J,Passos A,Cournapeau D,Brucher M,Perrot M,Duchesnay E. 2011. Scikit-learn:Machine learning in Python[J]. J Mach Learn Res,12:2825–2830.
|
Robusto C C. 1957. The Cosine-Haversine formula[J]. Am Math Mon,64(1):38–40.
|
Roeslin S,Ma Q,Juárez-Garcia H,Gómez-Bernal A,Wicker J,Wotherspoon L. 2020. A machine learning damage prediction model for the 2017 Puebla-Morelos,Mexico,earthquake[J]. Earthq Spectra,36(S2):314–339.
|
Shahriari B,Swersky K,Wang Z Y,Adams R P,De Freitas N. 2016. Taking the human out of the loop:A review of Bayesian optimization[J]. Proc IEEE,104(1):148–175. doi: 10.1109/JPROC.2015.2494218
|
Singhal A,Kiremidjian A S. 1996. Method for probabilistic evaluation of seismic structural damage[J]. J Structural Eng,122(12):1459–1467. doi: 10.1061/(ASCE)0733-9445(1996)122:12(1459)
|
Snoek J,Larochelle H,Adams R P. 2012. Practical Bayesian optimization of machine learning algorithms[C]//Proceedings of the 25th International Conference on Neural Information Processing Systems. Lake Tahoe:Curran Associates Inc.:2951−2959.
|
Stojadinović Z,Kovačević M,Marinković D,Stojadinović B. 2022. Rapid earthquake loss assessment based on machine learning and representative sampling[J]. Earthq Spectra,38(1):152–177. doi: 10.1177/87552930211042393
|
Suryanita R,Maizir H,Yuniarto E,Zulfakar M,Jingga H. 2017. Damage level prediction of reinforced concrete building based on earthquake time history using artificial neural network[C]//The 6th International Conference of Euro Asia Civil Engineering Forum. Seoul:Euro Asia Civil Engineering Forum, 138 :02024.
|
Tesfamariam S,Liu Z. 2010. Earthquake induced damage classification for reinforced concrete buildings[J]. Struct Saf,32(2):154–164. doi: 10.1016/j.strusafe.2009.10.002
|
USGS. 2007. vS30 models and data[DB/OL]. [2022-08-01]. https://earthquake.usgs.gov/data/vs30/.
|
Wald D J,Allen T I. 2007. Topographic slope as a proxy for seismic site conditions and amplification[J]. Bull Seismol Soc Am,97(5):1379–1395. doi: 10.1785/0120060267
|
Whitman R V,Reed J W,Hong S T. 1973. Earthquake damage probability matrices[C]//Proceedings of the Fifth World Conference on Earthquake Engineering. Rome:Palazzo dei Congressi (EUR):2531−2540.
|
Yuan X Z,Chen G D,Jiao P,Li L J,Han J,Zhang H B. 2022. A neural network-based multivariate seismic classifier for simultaneous post-earthquake fragility estimation and damage classification[J]. Eng Struct,255:113918. doi: 10.1016/j.engstruct.2022.113918
|
Zhao J X,Liang X,Jiang F,Xing H,Zhu M,Hou R B,Zhang Y B,Lan X W,Rhoades D A,Irikura K,Fukushima Y,Somerville P G. 2016a. Ground-motion prediction equations for subduction interface earthquakes in Japan using site class and simple geometric attenuation functions[J]. Bull Seismol Soc Am,106(4):1518–1534. doi: 10.1785/0120150034
|
Zhao J X,Zhou S L,Zhou J,Zhao C,Zhang H,Zhang Y B,Gao P J,Lan X W,Rhoades D,Fukushima Y,Somerville P G,Irikura K. 2016b. Ground‐motion prediction equations for shallow crustal and upper-mantle earthquakes in Japan using site class and simple geometric attenuation functions[J]. Bull Seismol Soc Am,106(4):1552–1569. doi: 10.1785/0120150063
|
Li Xueyan, Bian Yinju, Hou Xiaolin, Wang Tingting, Zhang Yixiao. 2025: Recognition of small magnitude seismic events type based on time-frequency features and machine learning. Acta Seismologica Sinica: 1-16. DOI: 10.11939/jass.20240012 | |
Yu Zining, Li Haifeng, Jing Xilong, Chi Chengquan, Zheng Haiyong. 2024: Borehole strain data based seismicity prediction analysis using a neural network. Acta Seismologica Sinica, 46(2): 327-339. DOI: 10.11939/jass.20230122 | |
Gong Liwen, Zhang Huai, Chen Shi, David A. Yuen, Chen Lijuan, Brennan Brunsvik, Yin Guangyao. 2023: Geometry features modeling of three-dimensional fault plane of Changning earthquake based on machine learning. Acta Seismologica Sinica, 45(6): 1040-1054. DOI: 10.11939/jass.20220079 | |
Li Jinxiang, Zhao Shuo, Jin Hua, Li Yafang, Guo Yin. 2019: A method of combined texture features and morphology for building seismic damage information extractionbased on GF remote sensing images. Acta Seismologica Sinica, 41(5): 658-670. DOI: 10.11939/jass.20190014 | |
Yan Wei, Liu Guiping, Li Mingxiao, Li Zhichao, Zhang Xiaotao, Zhou Longquan, Yuan Zhengyi. 2019: The evaluation method for the accuracy of short-term earthquake prediction. Acta Seismologica Sinica, 41(3): 399-409. DOI: 10.11939/jass.20190001 | |
Zhu Yongli, Li Dahu, Zhu Jiangang. 2017: Rapid evaluation method of life loss in earthquakes based on strong ground motion records. Acta Seismologica Sinica, 39(1): 143-154. DOI: 10.11939/jass.2017.01.012 | |
Huang Shusong, Dou Aixia, Wang Xiaoqing, Yuan Xiaoxiang. 2016: Building damage feature analyses based on post-earthquake airborne LiDAR data. Acta Seismologica Sinica, 38(3): 467-476. DOI: 10.11939/jass.2016.03.014. | |
Xia Caiyun, Zhang Yongxian, Zhang Xiaotao, WU Yongjia. 2015: Predictability test for pattern information method by two MS7.3 Yutian, Xinjiang, earthquakes. Acta Seismologica Sinica, 37(2): 312-322. DOI: 10.11939/jass.2015.02.011 | |
Jin Ping, Zhang Chengliu, Shen Xufeng, Wang Hongchun, Pan Changzhou, Yan Feng, Wang Dianyuan. 2014: A novel technique for automatic seismic data processing using both integral and local features of seismograms. Acta Seismologica Sinica, 36(3): 464-479. DOI: 10.3969/j.issn.0253-3782.2014.03.012 | |
2004: 华东地区地电阻率各向异性度的地震前兆异常特征初步研究. Acta Seismologica Sinica, 26(2): 223-227. |