Skip to main content
U.S. flag

An official website of the United States government

A spatial machine learning model developed from noisy data requires multiscale performance evaluation: Predicting depth to bedrock in the Delaware River Basin, USA

June 23, 2024

Spatial machine learning models can be developed from observations with substantial unexplainable variability, sometimes called ‘noise’. Traditional point-scale metrics (e.g., R2) alone can be misleading when evaluating these models. We present a multi-scale performance evaluation (MPE) using two additional scales (distributional and geostatistical). We apply the MPE framework to predictions of depth to bedrock (DTB) in the Delaware River Basin. Geostatistical analysis shows that approximately one third of the DTB variance is at spatial scale smaller than 2 km. Hence, we interpret our point-scale R2 of 0.3 (testing data) to be sufficient for regional-scale modelling. Bias-correction methods improve performance at two of the three MPE scales: point-scale change is negligible, while distributional and geostatistical performance improves. In contrast, bias correction applied to a global DTB model does not improve MPE performance. This work encourages scale-appropriate performance evaluations to enable effective model intercomparison.

Publication Year 2024
Title A spatial machine learning model developed from noisy data requires multiscale performance evaluation: Predicting depth to bedrock in the Delaware River Basin, USA
DOI 10.1016/j.envsoft.2024.106124
Authors Phillip J. Goodling, Kenneth Belitz, Paul Stackelberg, Brandon J. Fleming
Publication Type Article
Publication Subtype Journal Article
Series Title Environmental Modelling & Software
Index ID 70255572
Record Source USGS Publications Warehouse
USGS Organization Maryland Water Science Center; National Water Quality Program; WMA - Earth System Processes Division; Maryland-Delaware-District of Columbia Water Science Center