Skip to main content
U.S. flag

An official website of the United States government

Cost-Benefit Analysis of Computer Resources for Machine Learning

January 1, 2007

Machine learning describes pattern-recognition algorithms - in this case, probabilistic neural networks (PNNs). These can be computationally intensive, in part because of the nonlinear optimizer, a numerical process that calibrates the PNN by minimizing a sum of squared errors. This report suggests efficiencies that are expressed as cost and benefit. The cost is computer time needed to calibrate the PNN, and the benefit is goodness-of-fit, how well the PNN learns the pattern in the data. There may be a point of diminishing returns where a further expenditure of computer resources does not produce additional benefits. Sampling is suggested as a cost-reduction strategy. One consideration is how many points to select for calibration and another is the geometric distribution of the points. The data points may be nonuniformly distributed across space, so that sampling at some locations provides additional benefit while sampling at other locations does not. A stratified sampling strategy can be designed to select more points in regions where they reduce the calibration error and fewer points in regions where they do not. Goodness-of-fit tests ensure that the sampling does not introduce bias. This approach is illustrated by statistical experiments for computing correlations between measures of roadless area and population density for the San Francisco Bay Area. The alternative to training efficiencies is to rely on high-performance computer systems. These may require specialized programming and algorithms that are optimized for parallel performance.

Publication Year 2007
Title Cost-Benefit Analysis of Computer Resources for Machine Learning
DOI 10.3133/ofr20071398
Authors Richard A. Champion
Publication Type Report
Publication Subtype USGS Numbered Series
Series Title Open-File Report
Series Number 2007-1398
Index ID ofr20071398
Record Source USGS Publications Warehouse
USGS Organization Western Geographic Science Center