A station-scale modeling framework for heavy rainfall classification in tropical weather using representative machine learning approaches

ADI MULSANDI, MIFTAHUDDIN MIFTAHUDDIN

Abstract


Extreme daily rainfall in rapidly urbanizing tropical cities frequently overwhelms drainage and disrupts critical services, yet station-scale forecasting remains limited by convective variability and sparse observations. This motivates lightweight, interpretable machine-learning tools that operate on routine station data. We propose and evaluate a station-scale framework to classify heavy-rainfall days (≥50 mm) in a humid tropical setting. Using 1,796 daily observations from the Soekarno-Hatta Meteorological Station (2018–2022), we engineered lag-informed predictors (e.g., previous-day rainfall, 3-day sums/means) and compared three representative classifiers, Logistic Regression (LR), Support Vector Machine (SVM), and Random Forest (RF). Class imbalance was addressed with class-weighted training, and models were assessed on a held-out test set using precision, recall, F1, and Receiver Operating Characteristic – Area Under the Curve (ROC-AUC). LR achieved the highest recall (0.429), indicating moderate sensitivity to rare heavy-rainfall events, whereas RF yielded the best probabilistic discrimination (AUC = 0.619) but failed to flag positives at the default threshold; SVM displayed near-random behavior. Feature analyses highlighted humidity, temperature, and recent rainfall accumulation as the most influential predictors, consistent with tropical convective processes. Despite severe class imbalance, simple, station-based classifiers can extract actionable signals for rare-event screening in data-limited tropical regions. Operational value is likely to improve through probability calibration and threshold tuning, ensemble integration, and spatial generalization to multi-station settings.


Keywords


Atmospheric modeling; daily rainfall; heavy rainfall prediction; machine learning; tropical climate

References


K. Yamamoto, R.; Nakamura,; A. Sato. 2022. Urban Flooding Risk under Climate Change in Jakarta. J. Disaster Res. 17 516–523. DOI: 10.20965/jdr.2022.p0516.

Li, F.; Pan, X.; Xu, N.; Meng, X.; Li, Z.; Guluzade, R.; et al. Does Urbanization Exacerbate Asymmetrical Changes in Precipitation at Divergent Time Scales in China?. Earth’s Future. 12. DOI: 10.1029/2023ef004355.

Huang, S., Gan, Y., Zhang, X., Chen, N., Wang, C., Gu, X., et al. 2023. Urbanization Amplified Asymmetrical Changes of Rainfall and Exacerbated Drought: Analysis Over Five Urban Agglomerations in the Yangtze River Basin, China. Earth’s Future. 11. DOI: 10.1029/2022ef003117.

Sutanudjaja, E. 2023. Urban Rainfall Extremes and Climate Change Impacts in Jakarta. Sustainability. 16 350. DOI: 10.3390/su16010350.

Wübbelmann, T.; Förster, K.; Bouwer, L.M.; Dworczyk, C.; Bender, S.; Burkhard, B. 2023. Urban Flood Regulating Ecosystem Services Under Climate Change: How Can Nature-Based Solutions Contribute. Front. Water. 5. DOI: 10.3389/frwa.2023.1081850.

Zhang, C. 2017. Madden–Julian Oscillation: Bridging Weather and Climate. J. Atmos. Sci. 74 3217–3240. DOI: 10.1175/JAS-D-16-0319.1.

Davison, C.; Haynes, R. 2024. Challenges in Predicting Tropical Convective Rainfall: The Role of Model Complexity. Weather Clim. Dyn., 5 1153–1170. DOI: 10.5194/wcd-5-1153-2024.

Fan, X.; Miao, C.; Duan, Q.; Shen, C.; Wu, Y. 2020. The Performance of CMIP6 Versus CMIP5 in Simulating Temperature Extremes Over the Global Land Surface. J. Geophys. Res. Atmos. 125. DOI: 10.1029/2020jd033031.

Gluck, S. 2023. Robustness of Climate Models. Philos. Sci. 90 1407 - 1416 DOI: 10.1017/psa.2023.62.

Wong‐Parodi, G.; Relihan, D. P.; Garfin, D. R. 2024. Dynamic Model of Climate Action. Environ. Res. Lett. 19. DOI: 10.1088/1748-9326/ad6917.

Fujita, M.; Mizuta, R.; Ishii, M.; Endo, H.; Sato, T.; Okada, Y.; et al. 2019. Precipitation Changes in a Climate With 2‐K Surface Warming From Large Ensemble Simulations Using 60‐km Global and 20‐km Regional Atmospheric Models. Geophys. Res. Lett. 90 435-442. DOI: 10.1029/2018gl079885.

Menary, M. B.; Robson, J.; Allan, R. P.; Booth, B. B. B.; Cassou, C.; Gastineau, G.; et al. 2020. Aerosol‐Forced AMOC Changes in CMIP6 Historical Simulations. Geophys. Res. Lett. 47. DOI: 10.1029/2020gl088166.

Bevacqua, E.; Suarez-Gutierrez, L.; Jézéquel, A.; et al. 2023. Advancing Research on Compound Weather and Climate Events via Large Ensemble Model Simulations. Nat. Commun. 14 2145. DOI: 10.1038/s41467-023-37847-5.

Hermans, T. H. J.; Busecke, J. J. M.; Wahl, T.; Malagón-Santos, V.; Tadesse, M. G.; Jane, R. A.; van de Wal, R. S. W. 2024. Projecting Changes in the Drivers of Compound Flooding in Europe Using CMIP6 Models. Earth’s Future 12. DOI: 10.5194/egusphere-egu24-5030.

Yu, H.; Chen, J.; Xu, C.; Xiong, L.; Chen, H. 2018. Bias Nonstationarity of Global Climate Model Outputs: The Role of Internal Climate Variability and Climate Model Sensitivity. Int. J. Climatol. 39 2278–2294. DOI: 10.1002/joc.5950.

Bala, R; Pandey, H. 2020. Mathematics of Climate Modeling: Challenges and Perspectives. Turk. J. Comput. Math. Educ. 11 2900-2904. DOI: 10.61841/turcomat.v11i3.14657.

Mengistu, A. G.; Woldesenbet, T. A.; Dile, Y. T. 2021. Evaluation of the Performance of Bias-Corrected CORDEX Regional Climate Models in Reproducing Baro–Akobo Basin Climate. Theor. Appl. Climatol. 144 751–767. DOI: 10.1007/s00704-021-03552-w.

Soldatenko, S.; Bogomolov, A.; Ronzhin, A. 2021. Mathematical Modelling of Climate Change and Variability in the Context of Outdoor Ergonomics. Mathematics 9. DOI: 10.3390/math9222920.

Gaudemet, J.-B.; Deschamps, J.; Vinciguerra, O. 2022. A Stochastic Climate Model -- An Approach to Calibrate the Climate-Extended Risk Model (CERM). DOI: 10.48550/arxiv.2205.02581.

Sidorova, M. A.; Yasinsky, S. 2024. Assessment of River Flow Future Changes in the Upper Volga and Kama Basins Based on Climate Modeling Data. E3S Web Conf. 555 6. DOI: 10.1051/e3sconf/202455501012.

Mulsandi, A.; Kristianto, A; Zakir, A. 2018. Perbaikan Prediksi Cuaca Numerik Kejadian Curah Hujan Lebat Terkait dengan Kejadian Longsor Di Banjarnegara Menggunakan Asimilasi Data Satelit. J. Sains Teknol. Modif. Cuaca 19 51–60. DOI: 10.29122/JSTMC.V19I2.3127.

Chattopadhyay, A.; Nabizadeh, E.; Hassanzadeh, P. 2020. Analog Forecasting of Extreme‐Causing Weather Patterns Using Deep Learning. J. Adv. Model. Earth Syst. 12. DOI: 10.1029/2019ms001958.

David, R.; Priya, L.; Donti; Lynn, H; Kaack; Kochanski, K.; et al. 2019. Tackling Climate Change With Machine Learning. ACM Comput. Surv. 55 1–96. DOI: 10.1145/3485128.

Amry, Z.; Mulyono, M.; Amalia, S. N. 2024. Application and Evaluation of Multivariate Singular Spectrum Analysis in Meteorological Data. J. Phys. Conf. Ser. 2908. DOI: 10.1088/1742-6596/2908/1/012024.

Chen, R.; Zhang, W.; Wang, X. 2020. Machine Learning in Tropical Cyclone Forecast Modeling: A Review. Atmosphere. 11 676. DOI: 10.3390/atmos11070676.

Liyew, C. M.; Melese, H. A. 2021. Machine Learning Techniques to Predict Daily Rainfall Amount. J. Big Data. 8 153 DOI: 10.1186/s40537-021-00545-4.

Abuhammad, H.; Al-Fraihat, D.; Faisal Alzyoud, Y. S. 2024. Rainfall Prediction Using Deep Learning Algorithms. J. Electr. Syst.. 20 1696-1704. DOI: 10.52783/jes.3765.

Zhang, X.; Wu, X. 2023. Combined Forecasting Model of Precipitation Based on the CEEMD-ELM-FFOA Coupling Model. Water. 15 1485 DOI: 10.3390/w15081485.

Astutik, S.; Rahmi, N.; Irsandy, D.; Saniyawati, F.; Mashfia, F.; Lusiana, E.; Risda, I.; Susanto, M. 2024. Bayesian Neural Network Rainfall Modelling: A Case Study in East Java. Barekeng J. Ilmu Mat. Terap. 18 1105-1116. DOI: 10.30598/barekengvol18iss2pp1105-1116.

Uddin, M. J.; Li, Y.; Sattar, M. A.; Liu, M.; Yang, N. 2022. An Improved Cluster‐Wise Typhoon Rainfall Forecasting Model Based on Machine Learning and Deep Learning Models Over the Northwestern Pacific Ocean. J. Geophys. Res. Atmos. 127. DOI: 10.1029/2022jd036603.

Li, Z.; Ma, Y.; Liu, J.; Liu, Y.; Ren, W.; Zhao, Q. 2023. Short-Term Rainfall Forecasting by Combining BP-NN Algorithm and GNSS Technique for Landslide-Prone Areas. Atmosphere. 14 1309. DOI: 10.3390/atmos14081309.

Li, H.; Choy, S.; Zaminpardaz, S.; Carter, B.; Sun, C.; Purwar, S.; Liang, H.; Li, L.; Wang, X. . 2023. Investigating the Inter-Relationships Among Multiple Atmospheric Variables and Their Responses to Precipitation. Atmosphere. 14 571. DOI: 10.3390/atmos14030571.

Praveena, R.; Babu, T. R.; Birunda, M.; Sudha, G.; Sukumar, P.; Gnanasoundharam, J. 2023. Prediction of Rainfall Analysis Using RF and Support Vector Machine. J. Phys. Conf. Ser. 2466. DOI: 10.1088/1742-6596/2466/1/012032.

Waqas, M.; Humphries, U.W.; Wangwongchai, A.; Dechpichai, P.; Ahmad, S. 2023. Potential of Artificial Intelligence-Based Techniques for Rainfall Forecasting in Thailand: A Comprehensive Review. Water. 15 2979. DOI: 10.3390/w15162979.

Gao, Z.; Lu, X.; Chen, R.; Guo, M.; Wang, X. 2024. Constructing a Machine Learning Model for Rapid Urban Flooding Forecast in Sloping Cities Along the Yangtze River: A Case Study in Jiujiang,” Water. 16 1694. DOI: 10.3390/w16121694.

Huntingford, C.; Jeffers, E. S.; Bonsall, M. B.; Christensen, H. M.; Lees, T.; Yang, H. 2019. Machine Learning and Artificial Intelligence to Aid Climate Change Research and Preparedness. Environ. Res. Lett. 14 124007. DOI: 10.1088/1748-9326/ab4e55.

Barrera-Animas, A. Y.; Oyedele, L. O.; Bilal, M.; Akinosho, T. ; Delgado, J. M. D.; Akanbi, L. 2022. Rainfall Prediction: A Comparative Analysis of Modern Machine Learning Algorithms for Time-Series Forecasting. Mach. Learn. Appl. 7. DOI: 10.1016/j.mlwa.2021.100204.

Fu, H.; Zhang, W.; Li, C.; Hu, Z. 2022. A Rainfall Forecast Model Based on GNSS Tropospheric Parameters and BP-NN Algorithm. Atmosphere. 13 1045. DOI: 10.3390/atmos13071045.

Kumar, V.; Kedam, N.; Sharma, K.V.; Khedher, K.M.; Alluqmani, A.E. 2023. A Comparison of Machine Learning Models for Predicting Rainfall in Urban Metropolitan Cities. Sustainability. 15 13724. DOI: 10.3390/su151813724.

Schoppa, L.; Disse, M.; Bachmair, S. 2020. Evaluating the Performance of RF for Large-Scale Flood Discharge Simulation. J. Hydrol. 590. DOI: 10.1016/j.jhydrol.2020.125531.

Desai, S.; Ouarda, T. B. M. J. 2021. Regional Hydrological Frequency Analysis at Ungauged Sites With RF Regression. J. Hydrol. 594. DOI: 10.1016/j.jhydrol.2020.125861.

Bhusal, A.; Parajuli, U.; Regmi, S.; Kalra, A. 2022. Application of Machine Learning and Process-Based Models for Rainfall-Runoff Simulation in DuPage River Basin, Illinois. Hydrology. 9 117. DOI: 10.3390/hydrology9070117.

Bochenek, B.; Ustrnul, Z. 2022. Machine Learning in Weather Prediction and Climate Analyses-Applications and Perspectives,” Atmosphere. 13 180. DOI: 10.3390/atmos13020180.

Liu, C.; Deng, H.; Qiu, X.; Lu, Y.; Li, J. 2024. A New Post-Processing Method for Improving Track and Rainfall Ensemble Forecasts for Typhoons Over Eastern China. Atmosphere. 15 874. DOI: 10.3390/atmos15080874.

Zhang C. J.; Zeng J.; Wang H. Y.; Ma L. M.; Chu, H. 2020. Correction Model for Rainfall Forecasts Using the LSTM With Multiple Meteorological Factors. Meteorol. Appl. 27. DOI: 10.1002/met.1852.

Frame, J.M.; Kratzert, F.; Raney II, A.; Rahman, M.; Salas, F. R.; Nearing, G.S. 2021. Post‐Processing the National Water Model With Long Short‐Term Memory Networks for Streamflow Predictions and Model Diagnostics,” J. Am. Water Resour. Assoc. 57 885–905. DOI: 10.1111/1752-1688.12964.

Gaol, G. A. M. L.; Syafrullah, M.; Supardi, S. 2024. Comparison of Monthly Rainfall Prediction Using Long Short Term Memory and Multi Layer Perceptron Methods in South Tangerang City. J. Sisfokom (Sist. Inf. Komput.) 13 293–297. DOI: 10.32736/sisfokom.v13i2.2149.

Zoremsanga, C.; Hussain, J. 2024. An Evaluation of Bidirectional Long Short-Term Memory Model for Estimating Monthly Rainfall in India. Indian J. Sci. Technol. 17 1828-1837. DOI: 10.17485/ijst/v17i18.2505.

Pham, Q. B.; Yang, T. C.; Kuo, C. M.; Tseng, H. W.; Yu, P. S. 2019. Combing RF and Least Square Support Vector Regression for Improving Extreme Rainfall Downscaling. Water. 11 451. DOI: 10.3390/w11030451.

Abdullah, M.; Said, S. 2025. Performance Evaluation of Machine Learning Regression Models for Rainfall Prediction. Iran. J. Sci. Technol. Trans. Civ. Eng. 49 4231–4250. DOI: 10.21203/rs.3.rs-3258529/v1.

Sasaki, K.; Anh, V. T. 2023. Development of Precipitation Guidance for 36 Regions in Vietnam Up to 5 Days Ahead. Vietnam J. Hydrometeorol. 15 40-58. DOI: 10.36335/vnjhm.2023(15).40-58.

Zhou, Z.; Ren, J.; He, X.; Liu, S. 2021. A Comparative Study of Extensive Machine Learning Models for Predicting Long‐term Monthly Rainfall With an Ensemble of Climatic and Meteorological Predictors. Hydrol. Process. 35. DOI: 10.1002/hyp.14424.

Cheng, Q.; Jing, Q.; Collender, P. A.; Head, J.R.; Li, Q.; Yu, H.; Li, Z.; Ju, Y.; Chen, T.; Wang, P.; Cleary, P.; La, S. 2023. Prior Water Availability Modifies the Effect of Heavy Rainfall on Dengue Transmission: A Time Series Analysis of Passive Surveillance Data From Southern China. Front. Public Health 11. DOI: 10.3389/fpubh.2023.1287678.

F. Jafari, S. J. Mousavi, and K. Ponnambalam, “Predictive MPC-Based Operation of Urban Drainage Systems Using Input Data-Clustered Artificial Neural Networks Rainfall Forecasting Models,” Hydrology. 2023, DOI: 10.3390/hydrology10070139.

Jafari, F.; Mousavi, S.J.; Ponnambalam, K. 2023. Basin‐scale Monthly Rainfall Forecasts With a Data‐driven Model Using Lagged Global Climate Indices and Future Predicted Rainfall of an Adjacent Basin. Int. J. Climatol. 10 139. DOI: 10.1002/joc.8021.

Rajani M. A. B.; Sravani, R. S.; Swapna N. M. 2025. Comparison and Forecasting for Indian Rainfall Using Proposed and Time Series Models. Int. J. Bio-Resour. Stress Manag. 15 01-06. DOI: 10.23910/1.2024.5212.

Sittichok, K.; Rattanapan, N.; Sakulkaew, R. 2024. Long-Term Seasonal Rainfall Forecasting Using Regression Analysis and Artificial Neural Network Together With Larg-Scale Circulation Indices. ASEAN J. Sci. Technol. Rep. 27 .DOI: 10.55164/ajstr.v27i3.253507.

Yan, J.; Xu, T.; Yu, Y.; Xu, H. 2021. Rainfall Forecast Model Based on the TabNet Model. Water. 13 1272. DOI: 10.3390/w13091272.

Lim, S. Y.; Marzin, C.; Xavier, P.; Chang, C.; Timbal, B. 2017. Impacts of Boreal Winter Monsoon Cold Surges and the Interaction with MJO on Southeast Asia Rainfall. J. Clim., 30 4267–4281. DOI: 10.1175/JCLI-D-16-0546.1.

Nuryanto, D. E.; Pawitan, H.; Hidayat, R.; Aldrian, E. 2019. Characteristics of two mesoscale convective systems (MCSs) over the Greater Jakarta: case of heavy rainfall period 15–18 January 2013. Geosci. Lett. 6 1. DOI: 10.1186/s40562-019-0131-5.

Ferrett, S.; Yang G. Y.; Woolnough S.; Hodges, K.; Holloway, C. E. 2019. Linking extreme precipitation in Southeast Asia to equatorial waves. Q. J. R. Meteorol. Soc. 146 665–684. DOI: 10.1002/qj.3699.

Mulsandi, A.; Koesmaryono, Y.; Hidayat, R.; Faqih, A.; Sopaheluwakan, A. 2024. Detecting Indonesian Monsoon Signals and Related Features Using Space–Time Singular Value Decomposition (SVD),” Atmosphere, 15 187. DOI: 10.3390/atmos15020187.

Mulsandi, A.; Koesmaryono, Y.; Hidayat, R.; Faqih, A.; Sopaheluwakan, A. 2024. On The Interannual Variability of Indonesian Monsoon Rainfall (IMR): A Literature Review of The Role of its External Forcing. J. Meteorol. Geofis. 24 115–127. DOI: 10.31172/jmg.v24i2.1049.

Mulsandi, A.; Koesmaryono, Y.; Hidayat, R.; Faqih, A.; Sopaheluwakan, A. 2024. Evaluasi Performa Indeks Monsun Ausmi dan Wnpmi di Wilayah Indonesia. J. Sains Teknol. Modif. Cuaca. 22 61–70. Available: https://ejurnal.bppt.go.id/index.php/JSTMC/article/view/4705.

Chen, F.; Li, H.; Hu, S.; Jiang, S.; Li, J.; Wu, R. 2023. Formation Mechanisms of the ‘5·31’ Record-Breaking Extreme Heavy Rainfall Process in South China in 2021. Atmosphere. 14 872. DOI: 10.3390/atmos14050872.

Zhang, Y.; Zang, S.; Shen, X.; Fan, G. 2021. Observed Changes of Rain-Season Precipitation in China From 1960 to 2018. Int. J. Environ. Res. Public Health 18 10031. DOI: 10.3390/ijerph181910031.

Kalbarczyk, R.; Kalbarczyk, E. 2024. Risk of Natural Hazards Caused by Extreme Precipitation in Poland in 1951–2020. Water. 16 1705. DOI: 10.3390/w16121705.

Rocha-Ruiz, D.; Martínez, C.; Gómez, A. 2022. Comparison of Machine Learning Models for Predicting Rainfall in Tropical Regions: The Colombian Case. Appl. Inform. 1643 106–120.

Guo, Y.; Cao, X.; B. Liu, and K. Peng, “El Niño Index Prediction Using Deep Learning with Ensemble Empirical Mode Decomposition. Symmetry. 12 893. DOI: 10.3390/SYM12060893.

Uddin, M. J.; Li, Y.; Tamim, M. Y.; Miah, M. B.; Ahmed, S. M. S. 2022. Extreme Rainfall Indices Prediction With Atmospheric Parameters and Ocean–Atmospheric Teleconnections Using a RF Model. J. Appl. Meteorol. Climatol. 61 651–667. DOI: 10.1175/jamc-d-21-0170.1.


Full Text: PDF

DOI: 10.24815/jn.v25i3.48605

Refbacks

  • There are currently no refbacks.