Skip to main content
U.S. flag

An official website of the United States government

An evaluation of sampling and full enumeration strategies for Fisher Jenks classification in big data settings

August 1, 2017

Large data contexts present a number of challenges to optimal choropleth map classifiers. Application of optimal classifiers to a sample of the attribute space is one proposed solution. The properties of alternative sampling-based classification methods are examined through a series of Monte Carlo simulations. The impacts of spatial autocorrelation, number of desired classes, and form of sampling are shown to have significant impacts on the accuracy of map classifications. Tradeoffs between improved speed of the sampling approaches and loss of accuracy are also considered. The results suggest the possibility of guiding the choice of classification scheme as a function of the properties of large data sets.

Publication Year 2017
Title An evaluation of sampling and full enumeration strategies for Fisher Jenks classification in big data settings
DOI 10.1111/tgis.12236
Authors Sergio J. Rey, Philip A. Stephens, Jason R. Laura
Publication Type Article
Publication Subtype Journal Article
Series Title Transactions in GIS
Index ID 70194256
Record Source USGS Publications Warehouse
USGS Organization Astrogeology Science Center