Skip to main content

A novel framework for predicting daily reference evapotranspiration using interpretable machine learning techniques

Research Abstract

Abstract Accurate estimation of daily reference evapotranspiration (ETo) is crucial for sustainable water resource management and irrigation scheduling, especially in water-scarce regions like Arizona. The standardized Penman–Monteith (PM) method is costly and requires specialized instruments and expertise, making it generally impractical for commercial growers. This study developed 35 ETo models to predict daily ETo across Coolidge, Maricopa, and Queen Creek in Pinal County, Arizona. Seven input combinations of daily meteorological variables were used for training and testing five machine learning (ML) models: Artificial Neural Network (ANN), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and Support Vector Machine (SVM). Four statistical indicators, coefficient of determination (R2), the normalized root-mean-squared error (RMSEn), mean absolute error (MAE), and simulation error (Se), were used to evaluate the ML models’ performance in comparison with the FAO-56 PM standardized method. The SHapley Additive exPlanations (SHAP) method was used to interpret each meteorological variable’s contribution to the model predictions. Overall, the 35 ETo-developed models showed an excellent to fair performance in predicting daily ETo over the three weather stations. Employing ANN10, RF10, XGBoost10, CatBoost10, and SVM10, incorporating all ten meteorological variables, yielded the highest accuracies during training and testing periods (0.994 ≤ R2 ≤ 1.0, 0.729 ≤ RMSEn ≤ 3.662, 0.030 ≤ MAE ≤ 0.181 mm·day−1, and 0.833 ≤ Se ≤ 2.295). Excluding meteorological variables caused a gradual decline in ET-developed models’ performance across the stations. However, 3-variable models using only maximum, minimum, and average temperatures (Tmax, Tmin, and Tave) predicted ETo well across the three stations during testing (17.655 ≤ RMSEn ≤ 13.469 and Se ≤ 15.45%). Results highlighted that Tmax, solar radiation (Rs), and wind speed at 2 m height (U2) are the most influential factors affecting ETo at the central Arizona sites, followed by extraterrestrial solar radiation (Ra) and Tave. In contrast, humidity-related variables (RHmin, RHmax, and RHave), along with Tmin and precipitation (Pr), had minimal impact on the model’s predictions. The results are informative for assisting growers and policymakers in developing effective water management strategies, especially for arid regions like central Arizona.

Research Authors
Elsayed Ahmed Elsadek, Mosaad Ali Hussein Ali, Clinton Williams, Kelly R Thorp, Diaa Eldin M Elshikha
Research Date
Research Member