Suitable Proportion Sample of Holdout Validation for Spatial Rainfall Interpolation in Surrounding the Makassar Strait

Giarno Giarno, Muhammad Pramono Hadi, Slamet Suprayogi, Sigit Heru Murti



Spatial rainfall interpolation requires a number of suitable validation samples to maintain accuracy. Generally, the larger the areas which can be predicted, the better the interpolation. In addition, the data used for validation should be separated from the modelling data. Moreover, the number of samples determine optimally proportion the independent sites. The objective of this study is to determine the optimal sample ratio for holdout validation in interpolation methods; the Makassar Strait was chosen as the study location because of its daily rainfall variation. The accuracy of the sample selection is tested using correlation, root mean square error (RMSE), mean absolute error (MAE) and the indicators of contingency tables. The results show that accuracy depends on the ratio of the modelling data. Therefore, the more extensive the data used for interpolation, the better the accuracy. Otherwise, if the rain gauge data is separated according to province, there will be a variation in accuracy in the portion of independent samples. For rainfall interpolation, it is recommended to use a minimum 75% of data sites to maintain accuracy. Comparison between kriging and inverse distance weighting or IDW methods indicates that IDW is better. Moreover, rainfall characteristics affect the accuracy and portion of the independent sample.


validation, independent sample, spatial interpolation, rainfall, Makassar Strait

